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(57) ABSTRACT 

The disclosed embodiments provide cyanobacteria spp. that 
have been genetically engineered to have increased produc- 
tion of carbon-based products of interest. These genetically 
engineered hosts efficiently convert carbon dioxide and light 
into carbon-based products of interest such as long chained 
hydrocarbons. Several constructs containing polynucleotides 
encoding enzymes active in the metabolic pathways of cyano- 
bacteria are disclosed. In many instances, the cyanobacteria 
strains have been further genetically modified to optimize 
production of the carbon-based products of interest. The opti- 
mization includes both up-regulation and down-regulation of 
particular genes. 
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FIG. 7. Mass spectra for linalool standard 
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FIG. 8. Mass spectra for linalool produced by engineered Anabaena 

(see FIG. 6B) 
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FIG. 9. Hydrocarbons produced by Anabaena cylindrica 29414 
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GENETICALLY ENGINEERED 
CYANOBACTERIA 

CROSS-REFERENCE TO RELATED 
APPLICATIONS 

This application claims priority to U.S. Provisional Patent 
Application Ser. No. 61/446,366, filed Feb. 24, 2011, and 
U.S. Provisional Patent Application Ser. No. 61/522,685, 
filed Aug. 11, 2011, the entire contents of each of which are 
incorporated herein by reference. 

STATEMENT REGARDING FEDERALLY 
SPONSORED RESEARCH OR DEVELOPMENT 

This invention was made with U.S. Government Support 
from the following agencies: USDA (Grant #SA1100114), 
NSF (Grant #CBET1133951), and NASA (Grant 
#NNX1 1AM03A). The U.S. Government has certain rights 
in this invention. 

TECHNICAL FIELD 

The present disclosure relates to compositions and meth- 
ods for the production of carbon-based products of interest 
such as biofuels and high value chemicals by genetically 
engineered cyanobacteria hosts. The genetically engineered 
cyanobacteria hosts are optimized for use in production of 
carbon-based products of interest by strengthening endog- 
enous metabolic pathways of cyanobacteria. In certain 
instances, competing metabolic pathways are down-regu- 
lated. Methods of making and using the genetically engi- 
neered cyanobacteria hosts are also described. 

BACKGROUND 

Many existing photoautotrophic organisms are poorly 
suited for industrial bioprocessing and have therefore not 
demonstrated commercial viability. Although aquatic photo- 
autotrophs, such as cyanobacteria, may exhibit rapid growth 
rates and efficient photosynthetic pathways, giving them tre- 
mendous potential for sustainable production of carbon- 
based products of interest from only CO,, N,, and sunlight, 
they have not yet been optimized for production. Such organ- 
isms typically require large amounts of water usage as well as 
time and energy to harvest biomass. Therefore, a need exists 
to modify existing photoautotrophhosts such that these draw- 
backs can be overcome. 

SUMMARY 

The present disclosure includes compositions and methods 
for the production of carbon based products of interest using 
genetically modified cyanobacteria such as Anabaena spp. In 
certain embodiments, the Anabaena spp. are Anabaena 
PCC7 120, Anabaena cylindrica 29414, or Anabaena variabi- 
lis ATCC29413. In one aspect of the disclosure, the Ana- 
baena spp. is the ethanol producing Anabaena sp. PCC7120 
(pZR672) strain deposited under ATCC accession number 
PTA- 12833 orthe linalool produc'mgAnahaena sp. PCC7120 
(pZR808) strain deposited under ATCC accession number 
PTA- 12832. Generally the Anabaena spp. is genetically engi- 
neered by expression of at least one recombinant polynucle- 
otide expression construct comprising an enzyme capable of 
increasing production of a carbon based product of interest. 

The carbon based product of interest may be ethanol or 
linolool. In many embodiments, the MEP pathway of the 


2 

Anabaena spp. is up-regulated by modifying at least one gene 
responsible for control of the MEP pathway in the Anabaena 
spp. Photosynthesis of the Anabaena spp. may also be 
increased through genetic modification. For example, a poly- 
5 nucleotide expression construct comprising a nucleotide 
sequence encoding RuBisCo and/or RuBisCo activase is con- 
templated. 

In certain embodiments, the Anabaena spp. is further 
genetically modified to produce enzymes capable of increas- 
ing specific production of ethanol or linolool. For example, in 
embodiments that specifically produce ethanol, the Ana- 
baena spp. may be genetically engineered to produce decar- 
boxylase (PDC) or alcohol dehydrogenase (ADH). In 
embodiments specifically producing linolool, the Anabaena 
spp. may be genetically engineered to produce linalool syn- 
15 thase. 

A disclosed method includes producing a genetically engi- 
neered Anabaena spp. capable of making a carbon based 
product of interest by introducing a recombinant enzyme into 
the Anabaena spp, wherein the recombinant enzyme can par- 
20 ticipate in the Anabaena spp’s natural metabolic pathway, 
and modifying at least one competing metabolic pathway to 
increase production of the carbon based product of interest. In 
one disclosed aspect, the Anabaena spp. is the ethanol pro- 
ducing Anabaena sp. PCC7120 (pZR672) strain deposited 
25 under ATCC accession number PTA-12833 or the linalool 
producing Anabaena sp. PCC7120 (pZR808) strain depos- 
ited under ATCC accession number PTA-12832. The natural 
metabolic pathway may be the MEP pathway or the photo- 
synthetic pathway and the carbon based product of interest 

may be ethanol or linalool. 

30 J 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 demonstrates the presumptive cyanobacterial car- 
bon metabolic pathways for production of biofuels and high 
35 value chemicals. 

FIG. 2 demonstrates the modified cyanobacterial carbon 
metabolic pathway for production of ethanol. 

FIG. 3 is ethanol productivity in genetically engineered 
Anabaena as measured by HPLC. 

40 FIG. 4 shows (A) the known MEP pathway as it exists in 
plants, algae and some bacterial and (B) the proposed syn- 
thetic pathway in cyanobacteria. 

FIG. 5 shows metabolic pathway for photosynthetic pro- 
duction of sucrose. 

45 FIG. 6 shows (B) linalool production in genetically engi- 
neered Anabaena as measured by GC/MS and (C) native 
production of long chain alkanes/alkenes in wild-type Ana- 
baena sp. PCC7120. 

FIG. 7 shows mass spectra for linalool (C 10 H 18 O) stan- 
50 dard. 

FIG. 8 shows mass spectra for linalool produced by engi- 
neered Anabaena. 

FIG. 9 shows hydrocarbons produced by Anabaena cylin- 
drica 29414. 

55 FIG. 1 0 shows engineering N 2 -fixing cyanobacteria to pro- 

duce urea using solar energy. 

FIG. 11 demonstrates sucrose produced by Anabaena sp. 
PCC7120. 

FIG. 12 illustrates a LinS gene integrated to Anabaena 
60 chromosome at loci A and B. 

FIG. 13 shows a table of the MEP pathway genes in cyano- 
bacteria. 

DETAILED DESCRIPTION 

65 

For describing invention herein, the exemplary embodi- 
ments in detail, it is to be understood that the embodiments 
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are not limited to particular compositions or methods, as the 
compositions and methods can, of course, vary. It is also to be 
understood that the terminology used herein is for the purpose 
of describing particular embodiments only, and is not 
intended to be limiting. Unless defined otherwise, all techni- 5 
cal and scientific terms used herein have the same meaning as 
commonly understood by one of ordinary skill in the art to 
which an embodiment pertains. Many methods and compo- 
sitions similar, modified, or equivalent to those described 
herein can be used in the practice of the current embodiments to 
without undue experimentation. 

As used in this specification and the appended claims, the 
singular forms “a,” “an” and “the” can include plural referents 
unless the content clearly indicates otherwise. Thus, for 
example, reference to “a cytokine” can include a combination 1 5 
of two or more cytokines. The term “or” is generally 
employed to include “and/or,” unless the content clearly dic- 
tates otherwise. 

As used herein, “about,” “approximately,” “substantially,” 
and “significantly” will be understood by person of ordinary 20 
skill in the art and will vary in some extent depending on the 
context in which they are used. If there are uses of the term 
which are not clear to persons of ordinary skill in the art given 
the context in which it is used, “about” and “approximately” 
will mean plus or minus <10% of particular term and “sub- 25 
stantially” and “significantly” will mean plus or minus >10% 
of the particular term. 

The term “polynucleotide” refers to a polymeric form of 
nucleotides of at least 10 bases in length. The term includes 
DNA molecules (e.g., cDNA or genomic or synthetic DNA) 30 
and RNA molecules (e.g., mRNA or synthetic RNA), as well 
as analogs of DNA or RNA containing non-natural nucleotide 
analogs, non-native intemucleoside bonds, or both. The 
nucleic acid can be in any topological conformation. For 
instance, the nucleic acid can be single-stranded, double- 35 
stranded, triple-stranded, quadruplexed, partially double- 
stranded, branched, hairpinned, circular, or in a padlocked 
conformation. An “isolated” polynucleotide is one which is 
substantially separated from other cellular components that 
naturally accompany the native polynucleotide in its natural 40 
host cell, e.g., ribosomes, polymerases and genomic 
sequences with which it is naturally associated. 

Polynucleotides may include both sense and antisense 
strands of RNA, cDNA, genomic DNA, and synthetic forms 
and mixed polymers of the above. They may be modified 45 
chemically or biochemically or may contain non-natural or 
derivatized nucleotide bases, as will be readily appreciated by 
those of skill in the art. Such modifications include, for 
example, labels, methylation, substitution of one or more of 
the naturally occurring nucleotides with an analog, inter- 50 
nucleotide modifications such as uncharged linkages (e.g., 
methyl phosphonates, phosphotriesters, phosphoramidates, 
carbamates, etc.), charged linkages (e.g., phosphorothioates, 
phosphorodithioates, etc.), pendent moieties (e.g., polypep- 
tides), intercalators (e.g., acridine, psoralen, etc.), chelators, 55 
alkylators, and modified linkages (e.g., alpha anomeric 
nucleic acids, etc.) Also included are synthetic molecules that 
mimic polynucleotides in their ability to bind to a designated. 

In certain embodiments, the polynucleotides are modified 
such that they contain preferential codon sequence for the 60 
host. 

Hie term “percent sequence identity” or “identical” in the 
context of polynucleotide sequences refers to the residues in 
the two sequences which are the same when aligned for 
maximum correspondence. The term “substantial homology” 65 
or “substantial similarity,” when referring to a polynucle- 
otide, indicates that, when optimally aligned with appropriate 
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nucleotide insertions or deletions with another polynucle- 
otide (or its complementary strand), there is nucleotide 
sequence identity in at least about 76%, 80%, 85%, at least 
about 90%, and at least about 95%, 96%, 97%, 98% or 99% 
of the nucleotide bases, as measured by any well-known 
algorithm of sequence identity. 

A heterologous sequence is a sequence that is in a different 
position or in a different amount than what is found in nature, 
whether or not the heterologous sequence is itself endog- 
enous (originating from the same host cell or progeny thereof) 
or exogenous (originating from a different host cell or prog- 
eny thereof). 

A recombinant molecule is a molecule, e.g., a gene or 
protein, that (1) has been removed from its naturally occur- 
ring environment, (2) is not associated with all or a portion of 
a polynucleotide in which the gene is found in nature, (3) is 
operably linked to a polynucleotide which it is not linked to in 
nature, or (4) does not occur in nature. In many embodiments, 
the recombinant molecule is an enzyme. The term “recombi- 
nant” can be used in reference to cloned DNA isolates, chemi- 
cally synthesized polynucleotide analogs, or polynucleotide 
analogs that are biologically synthesized by heterologous 
systems, as well as proteins and/or mRNAs encoded by such 
nucleic acids. A coding sequence is considered “recombi- 
nant” if it contains an insertion, deletion or a point mutation 
introduced artificially, e.g., by human intervention. 

Molecules are “operably linked” if there is a functional 
relationship between two parts in which the activity of one 
part (e.g., the ability to regulate transcription) results in an 
action on the other part (e.g., transcription of the sequence). 
Thus, a polynucleotide is “operably linked to a promoter” 
when there is a functional linkage between a polynucleotide 
expression control sequence (such as a promoter or other 
transcription regulation sequences) and a second polynucle- 
otide sequence (e.g., a heterologous polynucleotide), where 
the expression control sequence directs transcription of the 
polynucleotide. 

An “expression vector” or “construct” refers to a series of 
polynucleotide elements that are capable of transporting the 
polynucleotide elements into the host and permitting tran- 
scription of a gene in a host cell. Most embodiments require 
that the host have activity of the gene product as a conse- 
quence of being genetically engineered with an expression 
vector. For example, if the expression vector includes poly- 
nucleotide elements encoding a gene for an enzyme, the 
enzyme should have enzymatic activity after the host is 
genetically engineered. Typically, the expression vector 
includes a promoter and a heterologous polynucleotide 
sequence that is transcribed. Expression vectors or constructs 
may also include, e.g., transcription termination signals, 
polyadenylation signals, and enhancer elements. Constructs 
may also include polynucleotides that make them tempera- 
hire sensitive, antibiotic resistant, or chemically inducible. 
Expression vectors can replicate autonomously, or they can 
replicate by being inserted into the genome of the host cell. In 
exemplary embodiment, the construct encoding the desired 
enzyme is present on a “plasmid,” which generally refers to a 
circular double stranded DNA loop into which additional 
DNA segments may be ligated, but also includes linear 
double-stranded molecules such as those resulting from 
amplification by the polymerase chain reaction (PCR) or 
from treatment of a circular plasmid with a restriction 
enzyme. 

The term “recombinant host cell” or “engineered host cell” 
(or simply “host cell” or “host”) refers to a cell into which a 
recombinant polynucleotide has been introduced. Recombi- 
nant polynucleotides can be used to transform a variety of 
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hosts to produce a carbon-based product of interest. The host 
must be “competent to express,” such that it provides a suffi- 
cient cellular environment for expression of endogenous and/ 
or exogenous polynucleotides. A recombinant host cell may 
be an isolated cell or cell line grown in culture or may be a cell 
which resides in a living tissue or organism. Photoautotrophic 
organism hosts include organisms such as eukaryotic plants 
and algae, as well as prokaryotic cyanobacteria, green-sulfur 
bacteria, green non-sulfur bacteria, purple sulfur bacteria, 
and purple non-sulfur bacteria. 

In embodiments, the engineered cell of the invention is an 
algae and/or cyanobacterial organism selected from the group 
consisting of Acanthoceras, Acanthococcus, Acaryochloris, 
Achnanthes, Achnanthidium, Actinastrum, Actinochloris, 
Actinocyclus, Actinotaenium, Amphichrysis, Amphidinium, 
Amphikrikos, Amphipleura, Amphiprora, Amphithrix, 
Amphora, Anabaena, Anabaenopsis, Aneumastus, Anki- 
strodesmus, Ankyra, Anomoeoneis, Apatococcus, Aphani- 
zomenon, Aphanocapsa, Aphanochaete, Aphanothece, Api- 
ocystis, Apistonema, Arthrodesmus, Artherospira, 
Ascochloris, Asterionella, Asterococcus, Audouinella, Aula- 
coseira, Bacillaria, Balbiania, Bambusina, Bangia, 
Basichlamys, Batrachospermum, Binuclearia, Bitrichia, 
Blidingia, Botrdiopsis, Botrydium, Botryococcus, Botry- 
osphaerella, Brachiomonas , Brachysira, Brachytrichia. 
Brebissonia, Bidbochaete, Bumilleria, Bumilleriopsis, Calo- 
neis, Calothrix, Campylodiscus, Capsosiphon, Carteria, Cat- 
ena, Cavinula, Centritractus, Centronella, Ceratium, Cha- 
etoceros, Chaetochloris, Chaetomorpha, Chaetonella. 
Chaetonema, Chaetopeltis, Chaetophora, Chaeto- 
sphaeridium, Chamaesiphon, Cham, Characiochloris, 
Characiopsis, Characium, Chorales, Chilomonas, Chlaino- 
monas, Chlamydoblepharis, Chlamydocapsa, Chlamydomo- 
nas, Chlamydomonopsis , Chlamydomyxa, Chlamydo- 
nephris, Chlorangiella, Chlorangiopsis, Chlorella, 
Chlorobotrys , Chlorobrachis, Chlorochytrium, Chlorococ- 
cum, Chlorogloea, Chlorogloeopsis, Chlorogonium, Chlo- 
rolobion, Chloromonas, Chlorophysema, Chlorophyta, Chlo- 
rosaccus, Chlorosarcina, Choricystis, Chromophyton, 
Chromulina, Chroococcidiopsis, Chroococcus, Chroodacty- 
lon, Chroomonas, Chroothece, Chrysamoeba, Chrysapsis, 
Chrysidiastrum, Chrysocapsa, Chrysocapsella, Chrysocha- 
ete, Chrysochromulina, Chrysococcus, Chrysocrinus, Chry- 
solepidomonas, Chrysolykos, Chrysonebula, Chrysophyta, 
Chrysopyxis, Chrysosaccus, Chrysophaerella, Chrysos- 
tephanosphaera, Clodophora, Clastidium, Closteriopsis, 
Closterium, Coccomyxa, Cocconeis, Coelastrella, Coelas- 
trum, Coelosphaerium, Coenochloris, Coenoeoecus, Coeno- 
cystis, Colacium, Coleochaete, Collodictyon, Compsogonop- 
sis, Compsopogon, Conjugatophyta, Conochaete, 
Coronastmm, Cosmarium, Cosmioneis, Cosmocladium, 
Crateriportula, Craticula, Crinalium, Crncigenia, Cracig- 
eniella, Cryptoaulax, Cryptomonas, Cryptophyta, Cteno- 
phora, Cyanodictyon, Cyanonephron, Cyanophora, Cyano- 
phyta, Cyanotheee, Cyanothomonas, Cyclonexis, 
Cyclostephanos, Cyclotella, Cylindroeapsa, Cylindrocystis, 
Cylindrospermum, Cylindrotheea, Cymatopleura , Cymbella, 
Cymbellonitzschia, Cystodinium Dactyl ococcopsis, 

Debarya, Denticula, Dermatochrysis, Dennocarpa, Der- 
mocarpella, Desmatractum, Desmidium, Desmococcus, Des- 
monema, Desmosiphon, Diacanthos, Diacronema, Diades- 
tnis, Diatoma, Diatomella, Dicellula, Dichothrix, 
Dichotomococcus, Dicranochaete, Dictyochloris , Dictyo- 
coccus, Dictyosphaerium, Didymocystis, Didymogenes, 
Didymosphenia, Dilabifilum, Dimorphococcus , Dinobryon, 
Dinococcus, Diplochloris, Diploneis, Diplostauron, Distri- 
onella, Docidium, Draparnaldia, Dunaliella, Dysmorphoc- 
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occus, Ecballocystis, Elakatothrix, Ellerbeckia, Encyonema, 
Enteromorpha, Entocladia, Entomoneis, Entophysalis, 
Epichn’sis, Epipyxis, Epithemia, Eremosphaera, Euastrop- 
sis, Euastrum, Eucapsis, Eucocconeis, Eudorina, Euglena, 
5 Euglenophyta, Eunotia, Eustigmatophyta, Eutreptia, Falla- 
cia, Fischeretta, Fragilaria , Fragilariforma, Franceia, Fms- 
tulia, Curcilla, Geminella, Genicularia, Glaucocystis, Glau- 
cophyta, Glenodiniopsis, Glenodinium, Gloeocapsa, 
Gloeochaete, Gloeochrysis, Gloeococcus, Gloeocystis, 
10 Gloeodendron, Gloeomonas, Gloeoplax, Gloeothece, Gloeo- 
tila, Gloeotrichia, Gloiodictyon, Golenkinia, Golenldniopsis , 
Gomontia, Gomphocymbella, Gomphonema, Gomphos- 
phaeria, Gonatozygon, Gongrosia, Gongrosira, Goniochlo- 
ris, Gonium, Gonyostomum, Granulochloris, Granulocys- 
15 topsis, Groenbladia, Gymnodinium, Gymnozyga, 
Gyrosigma, Haematococcus, Hafniomonas, Hallassia, Ham- 
matoidea, Hannaea, Hantzschia, Hapalosiphon, Haplotae- 
nium, Haptophyta, Haslea, Hemidinium, Hemitoma, Herib- 
audiella, Heteromastix, Heterothrix, Hibberdia, 
20 Hildenbrandia, Hillea, Holopedium, Homoeothrix, Horman- 
thonema, Honnotila, Hyalobrachion, Hyalocardium, Hyalo- 
discus, Hyalogonium, Hvalotheca, Hvdrianum, Hydrococ- 
cus, Hydrocoleum, Hvdrocoryme, Hydrodictyon, Hydrosera, 
Hydrurus, Hyel/a, Hymenomonas, Isthmochloron, Johannes- 
25 baptistia, Juranyiella, Karayevia, Kathablepharis, Katod- 
inium, Kephyrion, Keratococcus, Kirchneriella, Klebsor- 
midium, Kolbesia, Koliella, Komarekia, Korshikoviella, 
Kraskella, Lagerheimia, Lagynion, Lamprothamnium, Lema- 
nea, Lepocinclis, Leptosira, Lobococcus, Lobocystis, 
30 Lobomonas, Luticola, Lyngbya, Malleochloris, Mallomonas, 
Mantoniella, Marssoniella, Martyana, Mastigocoleus, Gas- 
togloia, Melosira, Merismopedia, Mesostigma, Mesotae- 
nium, Micractinium, Micrasterias, Microchaete, Microco- 
leus, Micro cystis, Microglena, Micromonas, Microspora, 
35 Microthamnion, Mischococcus, Monochrysis, Monodus, 
Monomastix, Monoraphidium, Monostroma, Mougeotia, 
Mougeotiopsis, Myochloris, Myromecia, Myxosarcina, Nae- 
geliella, Nannochloris, Nautococcus, Navicula, Neglectella , 
Neidium, Nephroclamys, Nephrocytium, Nephrodiella, 
40 Nephroselmis, Netrium, Nitella, Nitellopsis, Nitzschia, Nodu- 
laria, Nostoc, Ochromonas, Oedogonium, Oligochaeto- 
phora, Onychonema, Oocardium, Oocystis, Opephora, 
Ophiocytium, Orthoseira, Oscillatoria, Oxyneis, Pachycla- 
della, Palmella, Palmodictyon, Pnadorina, Pannus, Paralia, 
45 Pascherina, Paulschulzia, Pediastrum, Pedinella, Pedinomo- 
nas, Pedinopera, Pelagodictyon, Penium, Peranema, Peri- 
diniopsis, Peridinium, Peronia, Petroneis, Phacotus, Phacus, 
Phaeaster, Phaeodermatium , Phaeophyta, Phaeosphaera, 
Phaeothamnion, Phormidium, Phycopeltis, Phyllariochloris, 
50 Phyllocardium, Phyllomitas, Pinnularia, Pitophora, Placo- 
neis, Planctonema, Planktosphaeria, Planothidium, Plec- 
tonema, Pleodorina, Pleurastrum, Pleurocapsa, Pleuro- 
cladia, Pleurodiscus , Pleurosigma, Pleurosira, 

Pleurotaenium, Pocillomonas, Podohedra, Polyblepharides, 
55 Poly chaetophora, Polyedriella, Polyedriopsis, Polygonio- 
chloris, Polyepidomonas , Polytaenia, Polytoma, Polytomella, 
Porphyridium, Posteriochromonas, Prasinochloris, Prasin- 
ocladus, Prasinophyta, Prasiola, Prochlorphyta, Prochloro- 
thrix, Protoderma, Protosiphon, Provasoliella, Prymnesium, 
60 Psammodictyon, Psammothidium, Pseudanabaena, Pseude- 
noclonium, Psuedocarteria, Pseudochate, Pseudoch- 
aracium, Pseudococcomyxa, Pseudodictyosphaerium , 
P seudokephyrion, Pseudoncobyrsa, Pseudoquadrigula, 
Pseudosphaerocystis, Pseudostaurastrum, Pseudostauro- 
65 sira, Pseudotetrastrum, Pteromonas, Punctastruata, Pyram- 
ichlamys, Pyramimonas, Pyrrophyta, Quadrichloris , Quad- 
ricoccus, Quadrigula, Radiococcus, Radiofilum, 
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Raphidiopsis, Raphidocelis, Raphidonema, Raphidophyta, 
Peimeria, Rhabdoderma, Rhabdomonas, Rhizoclonium, 
Rhodomonas, Rhodophyta, Rhoicosphenia, Rhopalodia, 
Rivularia, Rosenvingiella, Rossithidium, Rova, Scenedes- 
mus, Scherffelia, Schizochlamydella, Schizochlamys, Schi- 
zomeris, Schizothrix, Schroederia, Scolioneis, Scotiella, 
Scotiellopsis, Scourfieldia, Scytonema, Selenastrum, Sele- 
nochloris, Sellaphora, Semiorbis, Siderocelis, Diderocystop- 
sis, Dimonsenia, Siphononema, Sirocladium, Sirogonium, 
Skeletonema, Sorastrum, Spennatozopsis, Sphaerellocystis, 
Sphaerellopsis, Sphaerodinium, Sphaeroplea, Sphaerozo- 
sma, Spiniferomonas, Spirogyra, Spirotaenia, Spirulina, 
Spondylomorum, Spondylosium, Sporotetras, Spumella, 
Staurastrum, Stauerodesmus, Stauroneis, Staurosira, Stauro- 
sirella, Stenopterobia, Stephanocostis, Stephanodiscits, 
Stephanoporos , Stephanosphaera , Stichococcus , Stichog- 
loea, Stigeoclonium, Stigonema, Stipitococcus, Stokesiella, 
Strombomonas , Stylochrysalis, Stylodinium, Styloyxis, Sty- 
losphaeridium, Surirella, Sykidion, Symploca, Synechococ- 
cus, Synechocystis , Synedra, Synochromonas , Synura, Tabel- 
laria, Tabularia, Teilingia, Temnogametum, Tetmemorus, 
Tetrachlorella, Tetracyclus, Tetradesmus, Tetraedriella, Tet- 
raedron, Tetraselmis, Tetraspora, Tetrastrum, Thalassiosira, 
Thamniochaete, Thermosynechococcus, Thorakochloris, 
Thorea, Tolypella, Tolypothrix, Trachelomonas , Trachydis- 
cus, Trebouxia, Trentepholia, Treubaria, Tribonema, Tri- 
chodesmium, Trichodiscus, Trochiscia, Tryblionella, Ulo- 
thrix, Uroglena, Uronema, Urosolenia, Urospora, Uva, 
Vdcuolaria, Vaucheria, Volvox, Volvulina, Westella, 
Woloszynskia, Xanthidium, Xanthophyta, Xenococcus, 
Zygnema, Zygnemopsis, and Zygonium. In yet other related 
embodiments, the engineered cell provided by the invention 
is derived from a Chloroflexus , Chloronema, Oscillochloris, 
Heliothrix, Herpetosiphon, Roseiflexus, and Thermomicro- 
bium cell; a green sulfur bacteria selected from: Chlorobium, 
Clathrochloris, andProsthecochloris ; a purple sulfur bacteria 
is selected from: Allochromatium, Chromatium, Halochro- 
matium, Iso chromatium, Marichromatium, Rhodovulum, 
Thermochromatium, Thiocapsa, Thiorhodococcus, and Thio- 
cystis ; a purple non-sulfur bacteria is selected from: 
Phaeospirillum, Rhodobaca, Rhodobacter, Rhodomicro- 
bium, Rhodopila, Rhodopseudomonas , Rhodothalassium, 
Rhodospirillum, Rodovibrio, and Roseospira\ an aerobic 
chemolithotrophic bacteria selected from: nitrifying bacteria. 
Nitrobacteraceae sp., Nitrobacter sp., Nitrospina sp., Nitro- 
coccus sp., Nitrospira sp., Nitrosomonas sp., Nitrosococcus 
sp., Nitrosospira sp., Nitrosolobus sp., Nitrosovibrio sp.; col- 
orless sulfur bacteria suchas, Ihiovulum sp., Thiobacillus sp., 
Thiomicrospira sp., Thiosphaera sp., Thermothrix sp.; 
obligatory chemolithotrophic hydrogen bacteria, Hydrogeno- 
bacter sp., iron and manganese-oxidizing and/or depositing 
bacteria, Siderococcus sp., and magnetotactic bacteria, 
Aquaspirillum sp; an archaeobacteria selected from: metha- 
nogenic archaeobacteria, Methanobacterium sp., Methano- 
brevibaeter sp., Methanothermus sp., Methanococcus sp., 
Methanomicrobium sp., Methanospirillum sp., Methanoge- 
nium sp., Methanosarcina sp., Methanolobus sp., Methano- 
thrix sp., Methanococcoides sp., Methanoplanus sp.; 
extremely thermophilic sulfur-Metabolizers such as Thermo- 
proteus sp., Pyrodictium sp., Sulfolobus sp., Acidiamis sp., 
Bacillus subtilis, Saccharomyces cerevisiae, Streptomyces 
sp., Ralstonia sp., Rhodococcus sp., Corynebacteria sp., 
Brevibacteria sp., Mycobacteria sp., and oleaginous yeast; 
and extremophile selected from Pyrolobus fumarii; Synecho- 
coccus lividis, mesophiles, psychrophiles, Psychrobacter, 
insects, Deinococcus radiodurans, piezophiles, barophiles , 
hypergravity tolerant organisms, hypogravity tolerant organ- 
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isms, vacuum tolerant organisms, tardigrades, insects, 
microbes seeds, dessicant tolerant anhydrobiotic organisms, 
xerophiles, Artemia salina , nematodes, microbes, fungi, 
lichens, salt tolerant organisms halophiles, halobacteriacea, 
5 Dunaliella salina , pH tolerant organisms, alkaliphiles, 
N atronobacterium , Bacillus firmus OF4, Spirulina spp., aci- 
dophiles, Cyanidium caldarium, Ferroplasma sp., anaerobes, 
which cannot tolerate 0 2 , Methanococcus jannaschii, 
microaerophils, which tolerate some 0 2 , Clostridium, aer- 
10 obes, which require 0 2 , gas tolerant organisms, which toler- 
ate pure C0 2 , Cyanidium caldarium , metal tolerant organ- 
isms, metalotolerants, Ferroplasma acidarmanus Ralstonia 
sp CH34. 

15 In certain embodiments, the host is Nostoc punctiforme 
ATCC29133. In many embodiments, the host is an Anabaena 
spp of cyanobacterium. Anabaena provides several advan- 
tages above the cyanobacteria currently being genetically 
modified to produce carbon based products of interest. For 
20 example, Anabaena is capable of fixing its own N 2 for growth 
using heterocysts using only solar energy and water, allowing 
for less investment for growth. In one embodiment, the host is 
Anabaena PCC7120 ( Anabaena 7120). In another embodi- 
ment, the host is Anabaena cylindrica 29414. In yet another 
25 embodiment, the host is Anabaena variabilis ATCC29413. 

“Carbon-based products of interest” include alcohols such 
as ethanol, propanol, methylbutenol, linalool, geraniol, iso- 
propanol, butanol, butanetriol, menthol, fatty alcohols, fatty 
acid esters, wax esters; hydrocarbons (alkanes/alkenes) such 
30 as propane, hexane, octane/octane, squalane, myrcene, 
decene, pinene, farnesene, limonene, diesel, Jet Propellant 8 
(JP8); polymers such as terephthalate, 1,3-propanediol, 1,4- 
butanediol, polyols, Polyhydroxyalkanoates (PHA), poly- 
beta-hydroxybutyrate (PHB), acrylate, adipic acid, .epsilon. - 
35 caprolactone, isoprene, caprolactam, rubber; commodity 
chemicals such as lactate, Docosahexaenoic acid (DHA), 
3-hydroxypropionate, amino acids such as lysine, serine, 
aspartate, and aspartic acid, sorbitol, ascorbate, ascorbic acid, 
isopentenol, lanosterol, omega-3 DHA, itaconate, 1,3-buta- 
40 diene, ethylene, propylene, succinate, citrate, citric acid, 
sucrose, glutamate, malate, 3-hydroxypropionic acid (HPA), 
lactic acid, THF, gamma butyrolactone, pyrrolidones, 
hydroxybutyrate, glutamic acid, levulinic acid, acrylic acid, 
malonic acid; specialty chemicals including carotenoids such 
45 as lycopene, astaxanthin, |3-carotene, and canthaxanthin, iso- 
prenoids, itaconic acid; pharmaceuticals and pharmaceutical 
intermediates such as 7-aminodeacetoxycephalosporanic 
acid (7-ADCA)/cephalosporin, erythromycin, polyketides, 
statins, paclitaxel, docetaxel, terpenes, peptides, steroids, 
50 omega fatty acids and other such suitable products of interest. 
Such products are useful in the context of biofuels, i.e. any 
fuel with one or more hydrocarbons, one or more alcohols, 
one or more fatty esters or a mixture thereof that derives from 
a biological source industrial and specialty chemicals, as 
55 intermediates used to make additional products, such as nutri- 
tional supplements, neutraceuticals, polymers, paraffin 
replacements, personal care products and pharmaceuticals. 

In various embodiments, polynucleotides encoding 
enzymes are introduced into the host cell such that expression 
60 of the enzyme by the host under certain conditions results in 
increased production of a carbon-based product of interest. In 
certain cases, introduction takes place through transforma- 
tion of the host. “Increased production” or “up-regulation” of 
a carbon-based product of interest includes both augmenta- 
65 tion of native production of the carbon-based product of inter- 
est as well as production of a carbon-based product of interest 
in an organism lacking native production. For example, in 
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some instances production will be increased from a measur- 
able initial value whereas in other instances the initial value is 
zero. 

A recombinant expression construct for transformation of 
a host cell and subsequent integration of the gene(s) of inter- 
est is prepared by first isolating the constituent polynucle- 
otide sequences. In some embodiments, the gene(s) of inter- 
est are homologously integrated into the host cell genome. In 
other embodiments, the genes are non-homologously inte- 
grated into the host cell genome. Generally, constructs con- 
taining polynucleotides are introduced into the host cell using 
a standard protocol, such as the one set out in Golden S S et al. 
(1987) “Genetic engineering of the Cyanobacteria chromo- 
some” Methods Enzymol 153: 215-231 and in S. S. Golden 
and L. A. Sherman, J. Bacteriol. 158:36 (1984), incorporated 
herein by reference. The particular procedure used to intro- 
duce the genetic material into the host cell for expression is 
not particularly critical. Any of the well-known procedures 
for introducing heterologous polynucleotide sequences into 
host cells can be used. In certain embodiments, only a single 
copy of the heterologous polynucleotide is introduced. In 
other embodiments, more than a single copy, such as two 
copies, three copies or more than three copies of the heter- 
ologous polynucleotide is introduced. As is understood by the 
skilled artisan, multiple copies of heterologous polynucle- 
otides may be on a single construct or on more than one 
construct. 

In exemplary embodiments, the disclosed polynucleotides 
are operably connected to a promoter in the construct. As is 
understood in the art, a promoter is segment of DNA which 
acts as a controlling element in the expression of that gene. In 
one embodiment, the promoter is a native Anabaena pro- 
moter. For example, the promoter may be an Anabaena Pnir 
promoter such as the one described in Desplancq, D2005, 
Combining inducible protein overexpression with NMR- 
grade triple isotope labeling in the cyanobacterium Anabaena 
sp. PCC 7120. Biotechniques. 39:405-1 1 (SEQ ID NO. 1 ) or 
one having sequence identity of about 76%, 80%, 85%, at 
least about 90%, and at least about 95%, 96%, 97%, 98% or 
99% to SEQ ID NO. 1 . The promoter may also be an Ana- 
baena psbA promoter (SEQ ID NO. 2), Prbc z promoter (SEQ 
ID NO. 3) and/or E. coli P tac promoter (SEQ ID NO. 4) (Elhai. 
J. 1993. Strong and regulated promoters in the cyanobacte- 
rium Anabaena PCC 7120. FEMS Microbiol Lett. 114(2): 
179-84) or one having sequence identity of about 76%, 80%, 
85%, at least about 90%, and at least about 95%, 96%, 97%, 
98% or 99% to SEQ ID NO. 2, SEQ ID NO. 3, or SEQ ID NO. 
4. In some embodiments, the promoter is a combined dual 
promoter, i.e. a promoter containing more than one of the 
above. 

In some embodiments, the gene of interest is transiently 
introduced into the host cell through use of a plasmid or 
shuttle vector. In other embodiments, the gene of interest is 
permanently introduced into the chromosome of the host cell. 
Chromosomal integration techniques are known to the skilled 
artisan and have been described in Zhou and Wolk, 2002, 
Identification of an Akinete Marker Gene in Anabaena vari- 
abilis, J. Bacteriol., 184(9):2529-2532. Briefly, the gene of 
interest is fused to a promoter and then subcloned into an 
integration vector. This construct is introduced into the host 
cell for double homologous recombination at specific loci on 
the host cell chromosome. In many embodiments, homolo- 
gous recombination takes place at two loci of the host cell 
chromosome. The recombinant cells can be selected by moni- 
toring loss of a conditional lethal gene, such as sacB. Further 
diagnostic verification by the polymerase chain reaction can 
be perfonned. In many embodiments, the gene of interest will 
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be inserted into the chromosome at the site of a gene that is 
desired to be deleted or inactivated. 

After the host is genetically modified, the host is generally 
incubated under conditions suitable for production of the 
5 carbon-based product of interest. Culture conditions for vari- 
ous hosts are well documented in the literature. Typically, 
when the host is Anabaena , the host cell will be grown in a 
photoauto trophic liquid culture in BG-11 media, with an 1 
L/min air sparge rate and a pH set point of 7.5, controlled via 
to sparging with CO,, and the temperature maintained at 30° C. 

In many embodiments, strain engineering techniques such 
as directed evolution and acclimation will be used to improve 
the performance of various host cells. Strain engineering is 
known in the art (Hughes, S. R., K. M. Bischoff, W. R. 
15 Gibbons, S. S. Bang, R. Pinkelman, P. J. Slininger, N. 
Qureshi, S. Liu, B. C. Saha, J. S. Jackson, M. C. Cotta, J. O. 
Rich, and J. Javers. 2011. Random UV-C Mutagenesis of 
Scheffersomyces (formerly Pichia ) stipitis NRRL Y-7124 to 
Improve Anaerobic Growth on Lignocellulosic Sugars. J. Ind. 
20 Microbiol. Biotechnol. DOI 10.1007/x 10295-01 1-1012-x; 
Bock, S. A., Fox, S. L. and Gibbons. W. R. 1997. Develop- 
ment of a low cost, industrially suitable medium for produc- 
tion of acetic acid from glucose by Clostridium thermoaceti- 
cum. Biotechnol. Applied Bioch. 25: 1 1 7-1 25; Gibbons, W. R., 
25 N. Pulseher, and E. Ringquist. 1992. Sodium meta bisulfite 
and pH tolerance of Pleurotus sajor caju under submerged 
cultivation. Appl. Biochem. Biotechnol. 37:177-189. 

As host cells generally possess complex regulatory sys- 
tems for traits such as product tolerance, productivity, and 
30 yield, directed evolution and screening is often used to create 
global genome-wide alterations needed to develop strains 
with desired industrial characteristics. Certain embodiments 
will use directed evolution under elevated linalool concentra- 
tions, as well as fluctuating temperature, pH, and C0 2 /0 2 
35 levels to generate stable, heritable genetic improvements in 
product tolerance, productivity, yield, and robustness to pro- 
cess conditions. 

A. Ethanol 

In one embodiment, the host cell is genetically engineered 
40 to increase production of ethanol through transformation with 
an expression vector containing polynucleotides encoding 
ethanol producing enzymes. As used herein, an ethanol pro- 
ducing enzyme is an enzyme active in the end production of 
ethanol from a precursor molecule in a metabolic pathway. 
45 The polynucleotide encodes pyruvate decarboxylase (SEQ 
ID NO. 5) and/or alcohol dehydrogenase (SEQ ID NO. 6) in 
exemplary embodiments. Embodiments also include 
enzymes having sequence identity of about 76%, 80%, 85%, 
at least about 90%, and at least about 95%, 96%, 97%, 98% or 
50 99% to SEQ ID NO. 5 and SEQ ID NO. 6. The host is 
genetically engineered with polynucleotides encoding one or 
both enzymes. In many embodiments, host cells are engi- 
neered to express both enzymes. Known sources of poly- 
nucleotides encoding pyruvate decarboxylase and alcohol 
55 dehydrogenase exist. For example, the nucleic acid encoding 
the enzymes may be from organisms such as Zymomonas 
mobilis, Zymobacter paimae, or Saccharomyces cerevisciae 
(Ingram L O, Conway T, Clark D P, Sewell G W, Preston J F. 
1987. Genetic engineering of ethanol production in Escheri- 
60 chia coli. Appl Environ Microbiol. 53(1 0):2420-5). Any 
pyruvate decarboxylase (pdc) gene capable of expression in 
the host may be used in with the disclosed embodiments. In 
some embodiments, the pdc gene is the Zymomonas mobilis 
pdc gene. In these embodiments, the pdc gene is often 
65 obtained from the Zymomonas mobilis plasmid pLOI295. In 
other embodiments, the pdc gene is from Zymobacter 
paimae. The NCBI accession number for the complete pdc 
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protein sequence from Zymobacter paimae is AF474145. 
Similarly, any alcohol dehydrogenase (adh) gene capable 
expression in the host may be used with the disclosed embodi- 
ments. In some embodiments, the adh gene is the Zymomonas 
mobilis adhll gene. In these embodiments, the adh gene is 5 
often obtained from the Zymomonas mobilis plasmid 
pLOI295. 

Polynucleotides encoding genes such as omrA, ImrA, and 
ImrCD, which increase the ability of the host to handle com- 
mercially relevant amounts of ethanol, may be included on to 
the same or a different vector as the polynucleotides encoding 
the pdc and adh genes. Bourdineaud J P, Nelnne B, Tesse S, 
Lonvaud-Funel A. 2004. A bacterial gene homologous to 
ABC transporters protect Oenococcus oeni from ethanol and 
other stress factors in wine. Int. J. Food Microbiol. 92( 1 ): 1 - 15 
1 4 . For example, in some embodiments, the expression vector 
comprising the pdc/adh genes further comprises the omrA 
gene. In other embodiments, the expression vector compris- 
ing the pdc/adh genes further comprises the ImrA gene. In yet 
other embodiments, the expression vector comprising the 20 
pdc/adh genes further comprises the ImrCD gene. And in still 
further embodiments, the expression vector comprising the 
pdc/adh genes further comprises polynucleotides encoding 
the omrA, ImrA, and ImrCD genes. 

In host cells producing increased ethanol, the synthesis of 25 
pyruvate is additionally up-regulated in certain embodiments. 

In these embodiments, phosphohoglycerate mutase, enolase, 
and/or pyruvate kinase, are over-expressed. A construct con- 
taining genes of one or more of the above enzymes is designed 
using genes from Z. mobilis and S. cerevisiae. The construct 30 
is then used to genetically engineer a host. 

Ethanol producing Anabaena sp. PCC7120 (pZR672) 
strain was deposited at the American Type Culture Collection 
on Feb. 27, 2012, and given accession number PTA-12833. 
PTA- 12833 was deposited with the American Type Culture 35 
Collection ATCC at 10801 University Blvd. Manassas Va. 
20110-2209 USA. The deposit was made under the provi- 
sions of the Budapest Treaty on the International Recognition 
of Deposited microorganisms for the Purposes of Patent Pro- 
cedure and Regulations thereunder Budapest Treaty). Main- 40 
tenance of a viable culture is assured for thirty years from the 
date of deposit. The organism will be made available by the 
ATCC under the terms of the Budapest Treaty, and subject to 
an agreement between the Applicants and the ATCC which 
assures unrestricted availability of the deposited cells to the 45 
public upon the granting of patent from the instant applica- 
tion. 

B. Sucrose 

In yet another embodiment, the host cell is engineered to 
increase the production and excretion of sucrose through 50 
transformation with an expression vector containing poly- 
nucleotides encoding sucrose producing enzymes. As used 
herein, a sucrose producing enzyme is an enzyme active in the 
end production of sucrose from a precursor molecule in a 
photosynthetic pathway. In these embodiments, a polynucle- 55 
otide encoding sucrose-phosphate synthase (SPS) and/or 
sucrose-phosphate phosphatase (SPP) is introduced into the 
host cell and expressed such that the host cell increases its 
production of sucrose. Known sources of SPS and SPP exist 
and any SPS or SPP gene capable of expression may be used 60 
with the disclosed embodiments. For example, polynucle- 
otide encoding SPS and SPP may be from organisms such as 
sugar beet and sugar cane such as those in SEQ ID NO. 7, 
SEQ ID NO. 8, SEQ ID NO. 9. In other embodiments, the 
polynucleotides have sequence identity of about 76%, 80%, 65 
85%, at least about 90%, and at least about 95%, 96%, 97%, 
98% or 99% to SEQ ID NO. 7, SEQ ID NO. 8, and SEQ ID 
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NO. 9. In an alternative embodiment, the polynucleotide 
encoding SPS and is from cyanobacteria such as Synchocys- 
tis, Anabaena , or the like. Polynucleotides of SPS from 
cyanobacteria are shown in SEQ ID NO. 10 and SEQ ID NO. 
11. In certain embodiments, SPS polynucleotides have 
sequence identity of about 76%, 80%, 85%, at least about 
90%, and at least about 95%, 96%, 97%, 98% or 99% to SEQ 
ID NO. 10 and SEQ ID NO. 1 1 . 

In exemplary embodiments, the expression vector encod- 
ing SPS and/or SPP includes a promoter. For example, in 
some embodiments, the expression vector includes an Ana- 
baena PpsbA promoter. In this embodiment the expression 
vector may be shuttle vector pRL489, such as the one 
described in Elhai J 1993 Strong and regulated promoters in 
the cyanobacterium Anabaena PCC7120. FEMS Microbiol. 
Lett. 114(2): 179-84. 

In many embodiments where sucrose production has been 
increased, intracellular sucrose concentrations are reduced by 
over-expression of sucrose exporter genes. A sucrose 
exporter gene is a gene encoding a polypeptide involved in the 
transport of sucrose out of the cell. An example sucrose 
exporter gene includes the sucrose exporter gene from maize, 
i.e. ZmSUTl (Slewinski et al., 2009. Sucrose transporter 1 
functions in phloem loading in maize leaves. J. Exp. Bot. 60 
(3):881 -892). A sucrose exporter gene is demonstrated by 
SEQ ID NO. 12. In some embodiments, the sucrose exporter 
genes have sequence identity of about 76%, 80%, 85%, at 
least about 90%, and at least about 95%, 96%, 97%, 98% or 
99% to SEQ ID NO. 12. The host in certain embodiments is 
genetically engineered with a sucrose exporter gene which is 
on the same construct as SPS and/or SPP. In other embodi- 
ments, the sucrose exporter genes may be from sugarcane and 
cloned into a separate expression vector or integrated into the 
chromosome of the host cells. Reinders A, Sivitz A B, Hsi A, 
Grof C P, Perroux J M, Ward J M. 2006. Sugarcane ShSUTl : 
analysis of sucrose transport activity and inhibition by sucral- 
ose. Plant Cell Environ. 29( 10): 1 87 1 -80 demonstrates the 
sucrose exporter gene of SEQ ID NO. 13. In exemplary 
embodiments, the sucrose exporter genes have sequence 
identity of about 76%, 80%, 85%, at least about 90%, and at 
least about 95%, 96%, 97%, 98% or 99% to SEQ ID NO. 13 

C. Urea 

Additionally, other urea cycle pathway intermediates are 
up-regulated and non-urea producing metabolic pathways are 
down-regulated or blocked in exemplary embodiments. For 
example, in one embodiment the urea cycle genes, i.e. CPS-1 , 
OTC, ASS, and AS, are up -regulated. Polynucleotides encod- 
ing the genes are operably connected to an Anabaena PglnA 
promoter and the host cell is genetically engineered with the 
constnict. 

D. Long Chain Alkanes 

In still another embodiment, host cells are engineered to 
increase production of long chain hydrocarbons such as 
alkanes/alkenes, i.e. C8-C18. In many embodiments with 
increased production of long chain hydrocarbons, secretion 
of the long chain hydrocarbons is also increased. Anabaena is 
innately capable of producing and secreting long-chain 
alkanes/alkenes. Long chain alkanes/alkenes can be pro- 
duced in Anabaena from both the fatty acid pathway and the 
MEP pathway. In the fatty acid pathway, acyl-ACP reductase 
(AR) combined with aldehyde decarbonylase (AD) convert 
fatty acid to alkanes/alkenes Schirmer A, Rude M A, Li X, 
Popova E, del Cardayre S B. 2010. Microbial biosynthesis of 
alkanes. Science. 329(5991 ):559-62. In embodiments where 
host cells are engineered to increase production of long chain 
alkanes, the host cell is genetically engineered with a poly- 
nucleotide encoding AR and/or AD. Known sources of AR 
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and AD exist in many cyanobacteria and any AR and AD gene 
capable of expression may be used with the disclosed 
embodiments. In many embodiments, the AR and/or AD 
genes are native Anabaena genes, i.e. native AR and/or AD 
are over-expressed. For example, in one embodiment the 
AR/AD genes will be from Anabaena cylindrica 29414 such 
as those demonstrated by SEQ ID NO. 14 and SEQ ID NO. 
15. In other embodiments, the AR and AD genes have 
sequence identity of about 76%, 80%, 85%, at least about 
90%, and at least about 95%, 96%, 97%, 98% or 99% to SEQ 
ID NO. 14 and SEQ ID NO. 15. 

E. Long-Chain Hydrocarbons from Isoprenoid Biosynthe- 
sis Pathway 

In still another embodiment, the host cell is engineered to 
increase the production of carbon-based products of interest 
from the native isoprenoid biosynthesis pathway, i.e. the 2-C- 
methyl-D-erythritol 4-phosphate (MEP) pathway. In many 
embodiments, excretion of the carbon-based products of 
interest is also increased. DMAPP and IPP, the early precur- 
sors for many carbon-based products of interest are made 
through MEP pathway in Anabaena. In heterotrophic organ- 
isms, DMAPP and IPP are made from precursors mainly 
derived from glucose through gluconeogenesis. However, as 
demonstrated in FIG. 4 photosynthetic organisms produce 
DMAPP and IPP from precursors directly synthesized from 
C0 2 via the Calvin cycle and perhaps also from photorespi- 
ration. Cyanobacteria, in addition to initiating the MEP path- 
way via glyceraldehyde-3-phosphate (G3P) and pyruvate, 
can use phosphorylated sugars directly from the Calvin cycle 
as precursors for entering into the MEP pathway. Due to their 
higher photosynthetic efficiency and greater innate MEP 
pathway flux for making DMAPP and IPP precursors, cyano- 
bacteria, such as Anabaena are especially suited for engineer- 
ing production of excreted carbon-based products of interest. 
Therefore, genetically engineering photosynthetic organisms 
such as Anabaena to produce MEP pathway carbon-based 
products of interest has greater advantages than genetically 
engineering heterotrophic organisms. 

In some embodiments, components of the MEP pathway 
are up-regulated to manipulate the DMAPP and IPP pool so 
as to maximize production of carbon-based products of inter- 
est. This up-regulation is achieved through transformation of 
the host by an expression vector with polynucleotides con- 
taining one or more of the eight genes of the MEP pathway. 
FIG. 4 and FIG. 13 show the individual components of the 
MEP pathway. The genes responsible for the MEP pathway 
include dxs, dxr, met, cmk, mds, hds, hdr, and idi. In many 
cases, the MEP pathway polynucleotide expression may be 
constructed as a synthetic operon. This operon is fused to an 
Anabaena psbA promoter in pZR807 (a pNIR derivative 
shuttle vector) in many embodiments. In certain embodi- 
ments, the dxr, hds, and hdr are from Synechocysitis sp. 
PCC6803. In Synechocysitis , the corresponding genes are 
sllOOl 9, slr2136, and slr0348 respectively. In another 
embodiment, DXS will be overexpreesed. Kuzuyama T, 
Takagi M, Takahashi S, Seto H.2000. Cloning and character- 
ization of 1-deoxy-D-xylulose 5-phosphate synthase from 
Streptomyces sp strain CL190, which uses both the meva- 
lonate and nomnevalonate pathways for isopentenyl diphos- 
phate biosynthesis. J. Bacteriol. 182(4):891-7, Cordoba E, 
Salmi M, Leon P. 2009. Unravelling the regulatory mecha- 
nisms that modulate the MEP pathway in higher plants. J Exp 
Bot. 60(10):2933-43, Alper H, Fischer C, Nevoigt E, Stepha- 
nopoulos G. 2005. Tuning genetic control through promoter 
engineering. Proc. Natl. Acad. Sci. USA. 102:12678-83, 
Alper H, Stephanopoulos G. 2008. Uncovering the gene 
knockout landscape for improved lycopene production in E. 
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coli. Appl. Microbiol. Biotechno/. 78:801-10. In this embodi- 
ment, to overexpress DXS, the DXS gene (alr0599) from 
Anabaena will be PCR amplified with primers containing 
restriction sites and a ribosome binding site. The resulting 
5 PCR product will be fused to a nitrate-inducible promoter 
Pnir and cloned into pZR807, a shuttle plasmid that can 
replicate both in E. coli and Anabaena. This construction will 
be introduced into Anabaena for overexpression of DXS. 

The genes of the MEP pathway are generally placed into 
1 0 the operon in the pathway order, although this is not required. 
The genes may be flanked with restriction nuclease sites 
non-native to the applicable genes to make insertion and 
deletion of specific genes more convenient. When the restric- 
tion sites are intended to allow removal of a portion of the 
15 operon and replacement with another sequence, different 
restriction enzyme sites are used on each side of the portion of 
the operon. When the restriction sites are intended to allow 
removal of a portion of the operon and not be replaced, the 
same restriction nuclease site exists on both sides. In most 
20 embodiments, restriction nuclease sites are engineered to 
produce sticky-ends. Polynucleotide sequences for indi- 
vidual genes have engineered ribosome binding sites in many 
embodiments. In some instances, the genes additionally 
include spacer sequences for enhancing translation of target 
2:1 genes. 

a. Linalool (C 10 H 18 O) 

Linalool (C 10 H 18 O) is a carbon-based product of interest 
produced from the MEP pathway where the universal iso- 
prenoid intermediate geranyl disphosphate (GPP) is con- 
30 verted to linalool by linalool synthase (LinS) (see FIG. 4 ). In 
these embodiments, host is genetically engineered with a 
polynucleotide encoding LinS such that the host cell has 
up-regulated production of linalool. Known sources of LinS 
genes exist and any LinS gene capable of being expressed 
may be used with the disclosed embodiments. For example, 
polynucleotide encoding LinS may be from a Norway 
Spruce. In many embodiments, the polynucleotide encoding 
LinS is not native to Anabaena. LinS genes such as CbLinS, 
McLinS, and LaLinS are well studied and contemplated for 
40 use in the disclosed embodiments. 

TABLE 1 


Genes required for linalool production in engineering cyanobacteria 


45 


Gene 

Accession 

Km 




name 

No. 

(jjM) Organism 

References 


linalool 

LaLINS 

DQ263741 

47.4 Lavandula 

Landmann 


synthase 



angustifolia 

et al., 2007 

50 


Me Lis 

AY083653 

25 Mentha 

Crowell et al., 




citrata 

2002 



CbLis 

U58314 

0.9 Clarkia 

Pichersky 





breweri 

et al., 1995 
Dudareva 
et al., 1996 


55 


In exemplary embodiments, the expression vector encod- 
ing LinS includes a promoter. For example, in some embodi- 
ments, the expression vector includes an Anabaena Pnir pro- 
moter. In this embodiment the expression vector may be a 
60 shuttle vector pZR807. 

In many embodiments, a host cell is genetically engineered 
with both polynucleotide encoding genes of the MEP path- 
way as well as LinS . This transformation may include a single 
expression vector or multiple expression vectors. In other 
65 embodiments, a LinS gene is fused to a promoter and then 
subcloned into an integration vector and this resulting con- 
struction pLinS is then introduced into the host cell for double 
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homologous recombination. The double recombinants are 
then selected by loss of a conditional lethal gene such as sacB . 

Linalool producing Anabaena sp. PCC7120 (pZR808) 
strain was deposited at the American Type Culture Collection 
on Feb. 27, 2012, and given accession number PTA-12832. 5 
PTA-12832 was deposited with the American Type Culture 
Collection (ATCC) at 10801 University Blvd., Manassas, Va. 
20110-2209 (USA). The deposit was made under the provi- 
sions of the Budapest Treaty on the International Recognition 
of Deposited microorganisms for the Purposes of Patent Pro- to 
cedure and Regulations thereunder (Budapest Treaty). Main- 
tenance of a viable culture is assured for thirty years from the 
date of deposit. The organism will be made available by the 
ATCC under the terms of the Budapest Treaty, and subject to 
an agreement between the Applicants and the ATCC which 15 
assures unrestricted availability of the deposited cells to the 
public upon the granting of patent from the instant applica- 
tion. 

b. Methylbutenol (C 5 H 10 O) 

Another carbon-based product of interest produced by an 20 
intermediate product from the MEP pathway, i.e. DMAPP, is 
methylbutenol (MBO). Methylbutenol is produced in the 
MEP pathway when DMAPP is converted to methylbutenol 
by methylbutenol synthase (MboS). In these embodiments, 
host cell is genetically engineered with a polynucleotide 25 
encoding MboS such that the host cell has up-regulated pro- 
duction of methylbutenol. Known sources of MboS exist and 
any MboS gene capable of being expressed may be used with 
the disclosed embodiments. In certain embodiments, the 
polynucleotide encoding MboS is from Pinus sabiniana and 30 
listed as below. Gray D W, Breneman S R, Topper L A, 
Sharkey T D. 201 1 , Biochemical characterization and homol- 
ogy modeling of methylbutenol synthase and implications for 
understanding hemiterpene synthase evolution in plants. J 
Biol. Chem. 286(23):20582-90. SEQ ID NO. 16. In other 35 
embodiments, MboS have sequence identity of about 76%, 
80%, 85%, at least about 90%, and at least about 95%, 96%, 
97%, 98% or 99% to SEQ ID NO. 16. 

In many embodiments, a host cell is genetically engineered 
with both polynucleotide encoding genes of the MEP path- 40 
way as well as MboS. This transformation may include a 
single expression vector or multiple expression vectors. 

c. Myrcene (C 10 H 16 ) 

Yet another carbon-based product of interest produced 
from an intermediate of the MEP pathway is myrcene. 45 
Myrcene is produced in the MEP pathway where the univer- 
sal isoprenoid intermediate geranyl disphosphate (GPP) is 
converted to myrcene by myrcene synthase (MyrS) Dudareva 
N, Martin D, Kish C M, Kolosova N, Gorenstein N, Faldt J, 
Miller B, Bohlmami J. 2003. (E)-beta-ocimene and myrcene 50 
synthase genes of floral scent biosynthesis in snapdragon: 
function and expression of three terpene synthase genes of a 
new terpene synthase subfamily. Plant Cell. 15(5): 1227-41. 
Martin D M, Faldt J, Bohlmami J. 2004. Functional charac- 
terization of nine Norway Spruce TPS genes and evolution of 55 
gymnosperm terpene synthases of the TPS-d subfamily. Plant 
Physiol. 135(4):1908-27. LijimaY, Davidovich-Rikanati R, 
Fridman E, Gang D R, Bar E, Lewinsohn E, Pichersky E. 
2004. The biochemical and molecular basis for the divergent 
patterns in the biosynthesis of terpenes and phenylpropenes 60 
in the peltate glands of three cultivars of basil. Plant Physiol. 
136(3):3724-36. No MyrS gene is founded in cyanobacterial 
genomes. In these embodiments, host is genetically engi- 
neered with a polynucleotide encoding MyrS such that the 
host cell has increased production of myrcene. Known 65 
sources of MyrS exist and any MyrS gene capable of being 
expressed may be used with the disclosed embodiments. In 


many embodiments, the polynucleotides encoding MyrS may 
be chosen from the organisms listed in the following table: 

TABLE 2 


Myrcene synthase gene required for engineering 
cyanobacteria to produce myrcence 


Gene 

Name 

Accession 

No. 

Organism 

Myrcene synthase Ag.2. 

U87908 

Abies grandis 

(MyrS) 



Amale20 

AA041726 

Antirrhinum majus 

PaTPs-Myr 

AY473626 

Norway Spruce 

MyS 

AAV63791 

Ocimum basilicum 

Ama0cl5 

AY195608 

Snapdragon 


In many embodiments, a host cell is genetically engineered 
with both polynucleotide encoding genes of the MEP path- 
way as well as MyrS. This transformation may include a 
single expression vector or multiple expression vectors. 

d. Famesene (C 15 H 24 ) 

And still another carbon based product of interest produced 
by MEP pathway is farnesene. Famesene is produced in the 
MEP pathway by conversion of geranyl-diphosphate (GPP) 
to farnesyl -diphosphate (FPP) by FPP synthase (FPPS). Sub- 
sequently, FPP is converted to farnesene by farnesene syn- 
thase (FarS) Maruyama T, Ito M, Honda G. 2001 . Molecular 
cloning, functional expression and characterization of (E)- 
beta famesene synthase from Citrus junos. Biol. Pharm. Bull. 
24:1 171-5 and Picaud S, Brodelius M, Brodelius P E. 2005. 
Expression, purification and characterization of recombinant 
(E)-beta -famesene synthase from Artemisia annua. Phy- 
tochemistry. 66(9):961 -7. In Anabaena, only a putative FPPS 
gene exists and no FarS gene is found. In these embodiments, 
host cell is genetically engineered with a polynucleotide 
encoding FPPS and FarS such that the host cell has increased 
production of farnesene. Known sources of FPPS and FarS 
exist and any FPPS or FarS gene capable of being expressed 
may be used with the disclosed embodiments. In many 
embodiments, the polynucleotides encoding FPPS and FarS 
are chosen from the organisms listed in the following table: 

TABLE 3 


Genes required for engineering cyanobacteria to produce farnesene 



Gene 

Name 

Accession 

No. 

Organism 

Farnesyl 

FDSI 

AY308477 

Artemisis tridentate 

diphosphate 
synthase (FPPS) 


TbFPPS 

AY1 58342 

Trypanosoma brucei 


FPS2 

NP_974565 

Arabidopsis thaliana 


ispA 

NP -414955 

E. coli K-12 


pFPS2 

U20771 

Lupinus albus 

Famesene synthase 

AFS1 

AY1 82241 

Malus domestica 

(FarS) 


CJFS 

AF374462 

Citrus junos 


CmTpsDul 

EU1 58099 

Cucumis melo L. 


FS 

AY835398 

Artemisis annua 


PmeTPS4 

AY906867 

Pseudotsuga 

menziesii 


In certain embodiments, the FPPS and FarS will be from 
the same organism. In other embodiments, the constmcts will 
include FPPS and FarS from different organisms. In many 
embodiments, a host cell is genetically engineered with both 
polynucleotide encoding genes of the MEP pathway as well 
as FPPS and FarS. This transformation may include a single 
expression vector or multiple expression vectors. 
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In most embodiments, production of carbon-based prod- 
ucts of interest is further optimized. For example, photosyn- 
thesis is optimized and/or competing metabolic pathways are 
blocked or inactivated. Photosynthetic rates can be increased 
by the over-expression of RuBisCo and RuBisCo activase. 5 
Hudson G S, Evans J R, von Caemmerer S, Arvidsson Y B, 
Andrews T J. 1992. Reduction of Ribulose-l,5-Bisphosphate 
Carboxylase/ Oxygenase Content by Antisense RNA Reduces 
Photosynthesis in Transgenic Tobacco Plants. Plant Physiol. 

98, 294-302 and Peterhansel C, Niessen M, Kebeish R M. 
2008. Metabolic engineering towards the enhancement of 
photosynthesis. Photochem. Photobiol. 84:1317-23. In 
embodiments where host cells producing the carbon-based 
products of interest using C0 2 and H 2 0 as the starting mate- 
rial, the hosts are often additionally genetically engineered 
with polynucleotides encoding RuBisCo and RuBisCo acti- 15 
vase. 

When carbon-based products of interest are produced from 
the MEP pathway, glycogen synthesis, which competes with 
the MEP metabolic pathway in the host is down-regulated or 
blocked in many embodiments. Glycogen synthesis is down- 20 
regulated or blocked by the down-regulation or block of ADP- 
glucose pyrophosphorylase (ADP-GPPase) activity. Pyru- 
vate dehydrogenase (PDH) is additionally or alternatively 
blocked in these embodiments. GPP flux may be optimized 
by downregulating farnesyl-disphosphate synthase (FPPS). 25 
Additionally, in certain embodiments genes for the tolerance 
of a host cell to economically relevant concentrations of the 
carbon based product of interest are included. In embodi- 
ments where competing carbon pathways are blocked or par- 
tially inactivated, this may be done using any method known 3< 1 
in the art. For example, enzymes in competing pathways can 
be knocked out or have their activity blocked or reduced. In 
certain embodiments, unmarked gene deletion created by 
double-crossover to delete target genes is used to delete Ana- 
baena genes. 35 


EXAMPLES 


The invention may be further clarified by reference to the 
following Examples, which serve to exemplify some of the 40 
embodiments and not to limit the invention in any way. The 
experiments were performed using the methodology 
described below. 


Example 1 45 

Conjugation 

Briefly, host cells are harvested by centrifugation and re- 
suspended in medium at a concentration of about 2-5xl0 8 50 
cells per ml. To one ml of this cell solution is added the 
appropriate construct to a final concentration of 2 pg/ml. Host 
cells are incubated in the dark for 8 hours followed by a 16 h 
light incubation prior to plating on media plates containing 
antibiotic. Plates are incubated under standard growth condi- 55 
tions (30° C. light intensity of 100 pmol photons m-2 S-l). 
Antibiotic resistant colonies are chosen and the genetically 
modified host cells are grown, bubbling with air at 30° C. and 
a light intensity of 100 pmol photons m-2 S-l in liquid 
medium containing an appropriate antibiotic 60 

Example 2 


Culture Growth 


65 

Transgenic cyanobacter cultures will be grown in liquid 
BG-1 1 medium in a lighted shaker (Innova 44R, New Brun- 


swick Scientific) at 30° C. and 150 pmol photons m-2 s-l. 
One week-old cultures will be used to re-inoculate 500 ml 
Erlemneyer flasks containing 1 00 ml liquid BG1 1 , which will 
then be incubated at 30° C. and 150 pmol photons m-2 s-l 
with a 24 h lighting set. Heterotrophic cultures will be supple- 
mented with 100 g L-l glucose. Samples will be collected at 
regular intervals and analyzed for product production, as well 
as chlorophyll content. Chlorophyll will be measured with a 
spectrophotometer. 


Example 3 
Ethanol Production 

Both pdc„ m and adhB„„, coding regions, with an engineered 
optimized SD sequence (ribosome binding site) immediately 
upstream of their initiation codons were PCR amplified from 
pLOI295, which contains both pdc_„, and adhB„„, in an arti- 
ficial operon. See Ingram L O et al . 1 987 Genetic Engineering 
of Ethanol Production in Escherichia coll. Appl. Environ. 
Microbiol. 53(10):2420-5. The DNA fragment was fused to 
Anabaena nitrate inducible promoter (nir) in shuttle vector. 
See Desplancq, D. et al. 2005 Combining inducible protein 
overexpression with NMR-grade triple isotope labeling in the 
cyanobacterium Anabaena sp. PCC 7120. Biotechniques. 
39:405-1 1 and Frias et al. 2000. Activation of the Anabaena 
nir operon promoter requires both NtcA (CAP family) and 
NtcB (LysR family) transcription factors. Mol. Microbiol. 
38:613-25. This construct, named pZR672, was introduced 
into Anabaena by conjugation. See Zhou, R. and Wolk, C. P. 
2002. Identification of an akinete marker gene in Anabaena 
variabilis. J Bacteriol. 184:2529-32; Wolk, C. P. et al. 1984 
Construction of shuttle vectors capable of conjugative trans- 
fer from Escherichia coli to nitrogen-fixing filamentous 
cyanobacteria, Proc Natl Acad Sci USA. 81:1561-5; and 
Zhou, R. and Wolk, C. P. 2003. A two-component system 
mediates developmental regulation of biosynthesis of a het- 
erocyst polysaccharide. JBiol. Chem. 278:19939-46. Geneti- 
cally engineered hosts were selected in a nitrate-minus (AA/8 
medium) Kan plate. Tests of ethanol production were done 
using well established protocols. Current ethanol productiv- 
ity, as shown in FIG. 3 is about 13.8 mg/liter/h/1.0A 700 , 

Example 4 
Sucrose Production 

Both sps and spp coding regions, with an engineered opti- 
mized SD sequence (ribosome binding site) immediately 
upstream of their initiation codons will be PCR amplified 
from sugarcane/ sugar beet cDNA. The DNA fragment will be 
fiised to Anabaena nitrate inducible promoter (nir) in shuttle 
vector pNIR. This construct will be introduced into Anabaena 
by conjugation. See Wolk, C. P. et al. 1984 Construction of 
shuttle vectors capable of conjugative transfer from Escheri- 
chia coli to nitrogen-fixing filamentous cyanobacteria, Proc 
Natl Acad Sci USA. 81:1561-5. Genetically transformed^a- 
baena will be selected in a nitrate-containing (AA/8 N 
medium) Km plate. Antibiotic resistant colonies will be cho- 
sen and the genetically modified host cells will be grown, 
bubbling with air at 30° C. and a light intensity of 100 pmol 
photons m-2 s-l in liquid medium containing appropriate 
antibiotic. HPLC tests of sucroseproductionby.477o/)oe;7o sp. 
PCC7120 are demonstrated in FIG. 11. 

Sucrose degradation will be reduced by blocking inver- 
tases and sucrose synthases (SuS) (see FIG. 5). Two genes, 
alr08 1 9 and alrl 52 1 , coding for Anabaena invertases and two 
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genes, all4985 and all 1059, coding for sucrose synthases will 
be inactivated in a double crossover approach, such as the one 
demonstrated in Zhou, R., Wolk, C. P. 2003. A two-compo- 
nent system mediates developmental regulation of biosynthe- 
sis of a heterocyst polysaccharide. J. Biol. Chem. 278:19939- 
46. Phosphofructokinase (PFK) will also be down-regulated 
in certain embodiments. The genes coding for Anabaena 
PFK, all7335 and alrl 91 9, will be down-regulated or knocked 
out using a double crossover approach or through expression 
of the antisense gene. In one embodiment, one PFK gene will 
be knocked out, while the other will be down-regulated. In 
another embodiment, both PFK genes will be down-regu- 
lated. 

Example 5 
Urea Production 

a. Create a novel strain with more closely spaced hetero- 
cysts. It is known that overexpression of patA gene in Ana- 
baena or inactivation of patN gene in Nostoc punctiforme led 
to more closely spaced single heterocysts, with an average 
vegetative cell interval of 3.2 cells (Meeks, J. C., E. L. Camp- 
bell, M. L. Summers, and F. C. Wong. 2002. Cellular Differ- 
entiation in the cyanobacterium Nostoc punctiforme. Arch. 
Microbiol. 178: 395-403; Liang J, Scappino L, Haselkorn R. 
1 992. The patA gene product, which contains a region similar 
to CheY of Escherichia coli, controls heterocyst pattern for- 
mation in the cyanobacterium Anabaena 7120. Proc. Natl. 
Acad. Sci. USA. 89(12):5655-9)). A novel Anabaena will be 
created by combining over-expression of patA and inactiva- 
tion of patN in Anabaena. This patA+patN- strain will serve 
as a model strain for further genetic modification to produce 
urea. 

b. Manipulate nitrogen flux in patA+patN- strain. Ana- 
baena will be engineered to convert surplus ammonia to urea. 
All 5 human homologous genes required for urea cycle are 
found in the Anabaena genome, as well as genes coding for 
urea transporters. The urea cycle’s final reaction is arginase- 
catalyzed hydrolysis of arginine to yield urea and regenerate 
ornithine (FIG. 10 ). Initially an authentic arginase LeARGl 
from tomato will be overexpressed in patA + patN _ strain and 
inactivate its urease Alr3666. Chen H, McCaig B C, Melotto 
M, He S Y, Howe G A. 2004, Regulation of plant arginase by 
wounding, jasmonate, and the phytotoxin coronatine. J. Biol 
Chem. 279(44):45998-6007. To overexpress these genes in 
Anabaena, the Anabaena PglnA, a constitutively strong pro- 
moter that functions in both vegetative cells and heterocysts, 
will be fused to urea cycle genes and followed by over- 
expression of them in the patA + patN“ urease“Le ARG + strain. 

c. Shut down the cyanophycin synthesis in patA + patN _ 
urease~LeARG + strain. Cyanophycin synthesis will be 
blocked and fixed nitrogen will be redirected to excreted urea. 
A single gene, all3879, encoding cyanophycin synthetase 
will be knocked out by a double crossover approach (Zhou R, 
Wolk C R 2003. A two-component system mediates develop- 
mental regulation of biosynthesis of a heterocyst polysaccha- 
ride. J. Biol. Chem. 278:19939-46). 

The disclosed genetically engineered urea-producing Ana- 
baena strains will be grown in a liquid N 2 -medium (Bgl 1 0 
medium which contains no combined nitrogen) in a lighted 
shaker (Innova 44R, New Brunswick Scientific) at 30° C. and 
150 pmol photons m-2 s-1. One week-old cultures will be 
used to re-inoculate 4-liter Erlemneyer flasks containing 
1 000 ml liquid BG1 1 0 , which will then be incubated at 30° C. 
and 150 pmol photons m-2 s-1 with a 24 h lighting set. 
Samples will be collected at regular intervals (24 h) and 


20 

analyzed for urea production. Urea excreted in the culture 
fluid will be measured by HPLC. Results will be used to guide 
further genetic manipulations. 

5 Example 6 

Long Chain Hydrocarbon Production and Isoprenoid 
Biosynthetic Pathway Product Production 

to a. Linalool Production 

To engineer Anabaena to produce linalool, CbLinS, 
McLinS, and LaLinS (see Table 1 ) will be transferred into 
Anabaena. The coding region of the three genes, with N-ter- 
minal plastid targeted sequence deletion, was cloned imme- 
15 diately downstream of the engineered translation initiation 
sequence (Shine-Dalargno sequence) under a dual promoter 
(Pnir/PsbA) in shuttle vector pZR807, a pNIR derived plas- 
mid that replicates 'mAnabaena. Each construct will be intro- 
duced into Anabaena by conjugation. 

20 Transgenic Anabaena cultures will be grown in liquid 
BG-1 1 medium in a lighted shaker (Innova 44R, New Brun- 
swick Scientific) at 30° C. and 150 pmol photons m-2 s-1. 
One week-old cultures will be used to re-inoculate 500 ml 
Erlemneyer flasks containing 100 ml liquid BG1 1 , which will 
25 then be incubated at 30° C. and 150 pmol photons m-2 s-1 
with a 24 h lighting set. Heterotrophic cultures will be supple- 
mented with 1 00 g L-l glucose. Samples will be collected at 
regular intervals and analyzed for linalool production, as well 
as chlorophyll content. 

30 Chlorophyll will be measured with a spectrophotometer. 
To measure volatile linalool, 2 ml culture samples will be 
placed a sealed 20 ml headspace tubes, and incubated at 30° 
C. for 2 hour. Volatiles will be sampled with a headspace 
sampler and measured by GC -MS. Linalool will be identified 
35 by comparison with genuine standard from GC-Standard 
grade liquid linalool. Linalool emission rates will be calcu- 
lated in nmol g-1 chlorophyll h-1 over 2 hour incubation by 
headspace analysis. Linalool in the culture fluid will be mea- 
sured by HPLC. Results will be used to guide further genetic 
40 manipulations. FIG. 6. demonstrates the production of lina- 
lool in transgenic Anabaena. 

b. Methylbutenol Production 

To engineer Anabaena to produce methylbutenol (MBO), 
methylbutenol synthase (MboS) will be transferred into Ana- 
45 baena. The coding region of the MboS, with N-terminal 
plastid targeted sequence deletion, was cloned immediately 
dow nstream of the engineered translation initiation sequence 
(Shine-Dalargno sequence) under a dual promoter (Pnir/ 
PsbA) in shuttle vector pZR807, a pNIR derived plasmid that 
50 replicates in Anabaena. Each construct was introduced into 
Anabaena by conjugation. Genetically engineered MBO- 
producing Anabaena strains (see above) will be grown in a 
liquid Bgl 1 medium which contains combined nitrogen in a 
lighted shaker (Innova 44R, New Brunswick Scientific) at 30° 
55 C. and 150 umol photons m-2 s- 1 . One week-old cultures will 
be used to re-inoculate 4-liter Erlemneyer flasks containing 
1 000 ml liquid BG1 1 , which will then be incubated at 30° C. 
and 150 pmol photons m-2 s-1 with a 24 h lighting set. 
Samples will be collected at regular intervals (24 h) and 
60 analyzed for MBO production. MBO excreted in the culture 
fluid will be measured by HPLC or GC/MS. Results will be 
used to guide further genetic manipulations. 

c. Myrcene Production 

To engineer Anabaena to produce myrcene, three MyrS 
65 genes in Table 2, i.e. ag2, ama0cl5, and At TPS 10 will be 
transferred into the host. The coding region of the three genes, 
with N-tenninal plastid targeted sequence deletion will be 
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cloned immediately downstream of the engineered transla- 
tion initiation sequence (Shine-Dalgarno sequence) under 
Anabaena psbA promoter (PpsbA) in shuttle vector pZR807, 
a plasmid that replicates in Anabaena and bears kanamycin 
resistance gene Kan*. The constructs will be individually 
introduced into the host by conjugation. Genetically engi- 
neered Anabaena will be selected in a nitrate-containing 
AA/N medium agar plate supplemented with kanamycin sul- 
fate. In certain experiments, a nitrate-inducible promoter will 
be used to replace the PpsbA promoter. In some experiments, 
an epitope tagged MyrS will be designed. The construct 
allows the 3' of MyrS gene in frame to link to FLAG 2 -His 6 
epitope tag engineered into the pZR807 vector once the MyrS 
gene stop codon is removed. Genetically engineered 
myrcene-producing Anabaena strains will be grown as 
described for linalool-producing strain. The myrcene produc- 
tion will measured by GC/MS as described for linalool mea- 
surement. 

d. Famesene Production 

FPPS and FarS genes from Artmisia will be constructed as 
an operon under the control of the psbA promoter in shuttle 
vectorpZR807. The construct will be individually introduced 
into Anabaena by conjugation. Genetically engineered Ana- 
baena will be selected in a nitrate-containing AA/N medium 
agar plate supplemented with kanamycin sulfate. In certain 
embodiments, a nitrate-inducible promoter will be used to 
replace the PpsbA promoter. In some embodiments, an 
epitope tagged FarS will be designed. The construct allows 
the 3' of FarS gene in frame to link to FLAG 2 -His 6 epitope tag 
engineered into the pZR807 vector once the FarS gene stop 
codon is removed. Famesene produced by engineered Ana- 
baena will be measured as described for linalool measure- 
ment. 

Example 7 

Optimization of Production of Carbon Based 
Products of Interest 

a. RuBisCo/RuBisCo Activase 

The native RuBisCo genes rbcL/S (slr009/slr001 2) and the 
putative RuBisCo activase (slrOOll) gene will be over-ex- 
pressed in hosts producing the carbon based product of inter- 
est. These three genes will be PCR amplified and fused to a 
strong Anabaena promoter PpsbA and subcloned into a 
shuttle vector for conjugation. 

FBP/SBPase will be over-expressed to boost RUBP levels. 
Hosts producing carbon based products of interest will be 
genetically engineered with FBP/SBPase from Synechococ- 
cus PCC794. See MiyagawaY, Tamoi M, Shigeoka S. 2001. 
Overexpression of a cyanobacterial fructose-1, 6-/sedoheptu- 
lose-l,7-bisphosphatase in tobacco enhances photosynthesis 
and growth. Nat. Biotechnol. 19(10):965-9 and Tamoi M, 
Nagaoka M, MiyagawaY, Shigeoka 5.2006. Contribution of 
fructose-1, 6-bisphosphatase and sedoheptulose-l,7-bisphos- 
phatase to the photo synthetic rate and carbon flow in the 
Calvin cycle in transgenic plants. Plant Cell Physiol. 47(3): 
380-90 

b. ADP-GPPase 

ADP-GPPase will be inactivated or deleted in certain 
genetically engineered Anabaena. ADP-GPPase may be 
inactivated using a double crossover knockout approach. This 
approach is well documented in Zhou R and Wolk C P. 2002 
Identification of an akinete marker gene in Anabaena vari- 
abilis. J. Bacteriol. 184:2529-32 and Zhou R and Wolk C P 
2003 A two-component system mediates developmental 
regulation of biosynthesis of a heterocyst polysaccharide. J. 
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Biol. Chem. 278:19939-46. In Anabaena, the ADP-GPPase 
gene is all4645. As shown in FIG. 12, for example, LinS gene 
fused to Anabaena promoter is subcloned to an integration 
vector (fragment A and B are from Anabaena chromosome) 
5 and this resulting construction pLinS is then introduced to 
Anabaena for double homologous recombination at loci A 
and B of Anabaena chromosome. The double recombinants 
will be selected on the sucrose/Km plate by losing the con- 
ditional lethal gene sacB in the vector portion (Cai Y P, Wolk 
10 C P. 1 990. Use of a conditionally lethal gene in Anabaena sp. 
strain PCC 7120 to select for double recombinants and to 
entrap insertion sequences. J. Bacteriol. June; 1 72(6):3 138- 
3145). The completely segregated double recombinants will 
15 be further verified by diagnostic PCR. Thus, the LinS/Km 
cassette from integration plasmid pLinS has replaced the 
gene all4645 (pink C in FIG. 12) in the double recombinants. 
In this example, gene all4645 lias been deleted from Ana- 
baena chromosome. 

20 c. PDH 

Anabaena PDH will be inactivated in some experiments. 
The internal fragment of alr4745, one of the three genes 
encoding Anabaena PDH multienzyme complex, will be 
amplified from Anabaena 7120 genomic DNA and cloned 
25 into pRL278, a plasmid designed for conjugative transfer into 
cyanobacteria. The alr4745 will be knocked out according to 
the method disclosed in Zhou R and Wolk C P 2003 A two- 
component system mediates developmental regulation of 
biosynthesis of a heterocyst polysaccharide. J. Biol. Chem. 
30 278:19939-46. 

d. GGPPS/SQS 

If a decrease in the FPP flux to terpeniods is desired, 
geranylgeranyl diphosphate synthase (GGPPS) and/or 
squalene synthase (SQS) expression will be down-regulated. 
35 SQS and or GGPS antisense sequences will be used to down- 
regulate GGPPS and/or SQS. The construct may additionally 
include an inducible promoter. The inducible promoter will 
be inducible by nitrate in many experiments. The gppS anti- 
sense sequence will be cloned downstream of a nitrate-induc- 
40 ible promoter and conjugatively transferred into hosts geneti- 
cally engineered to produce target products . Down-regulating 
GPPS will be achieved by inducing antisense RNA expres- 
sion with the addition of nitrate to the growth medium when 
cell density reaches the maximum. 

45 e. FPPS 

GPP flux will be optimized by down-regulating farnesyl- 
disphosphate synthase (FPPS). FPPS will be over-expressed 
in the antisense direction under an inducible promoter. The 
fppS antisense sequence will be cloned downstream of a 
50 nitrate-inducible promoter and conjugatively transferred into 
hosts genetically engineered to produce linalool or myrcene. 
Down-regulating FPPS is achieved by inducing antisense 
RNA expression with the addition of nitrate to the growth 
medium when cell density reaches the maximum. 

55 f. Pyruvate Synthesis 

Pynivate synthesis will be increased by over-expressing 
phosphoglycerate mutase, enolase, and pyruvate kinase (See 
FIG. 2). Three robust genes from Z. mobilis and from S. 
cerevisiae will be constructed as an artificial operon and fused 
60 to a PsbAl promoter and then cloned into an integrative 
vector to insert the enzyme genes within the coding region of 
alr4745 (encoding PDH-E3). This allows for increased syn- 
thesis of pyruvate while concurrently inactivating PDH. 

GP3 flux may be altered by over-expressing certain rate- 
65 limiting enzymes. The DXS gene (alr0599) from Anabaena 
and the Arabidopsis IDI gene (AT5G16440) will be PCR 
amplified with primers containing restriction sites and a ribo- 
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some binding site. The resulting PCR product will be fused to 
a nitrate-inducible promoter Pnir and cloned into pZR807. 

All of the references cited herein are incorporated by ref- 
erence in their entireties. 

From the above discussion, one skilled in the art can ascer- 5 
tain the essential characteristics of the invention, and without 
departing from the spirit and scope thereof, can make various 
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changes and modifications of the embodiments to adapt to 
various uses and conditions. Thus, various modifications of 
the embodiments, in addition to those shown and described 
herein, will be apparent to those skilled in the art from the 
foregoing description. Such modifications are also intended 
to fall within the scope of the appended claims. 


SEQUENCE LISTING 


<160> NUMBER OF SEQ ID NOS: 16 

<210> SEQ ID NO 1 

<211> LENGTH: 590 

<212> TYPE: DNA 

<213> ORGANISM: Anabaena sp . 

<22 0> FEATURE: 

<22 1> NAME/ KEY: misc_feature 

<223> OTHER INFORMATION: Anabaena Pnir promoter sequence 
<400> SEQUENCE: 1 


gatctagcta 

ctcattagtt 

aagtgtaatg 

cagaaaacgc 

atattctcta 

ttaaacttac 

60 

gcattaatac 

gagaattttg 

tagctactta 

tactatttta 

cctgagatcc 

cgacataacc 

120 

ttagaagtat 

cgaaatcgtt 

acataaacat 

tcacacaaac 

cacttgacaa 

atttagccaa 

180 

tgtaaaagac 

tacagtttct 

ccccggttta 

gttctagagt 

taccttcagt 

gaaacatcgg 

240 

cggcgtgtca 

gtcattgaag 

tagcataaat 

caattcaaaa 

taccctgcgg 

gaaggctgcg 

300 

ccaacaaaat 

taaatatttg 

gtttttcact 

attagagcat 

cgattcatta 

atcaaaaacc 

360 

ttacccccca 

gcccccttcc 

cttgtaggga 

agtgggagcc 

aaactcccct 

ctccgcgtcg 

420 

gagcgaaaag 

tctgagcgga 

ggtttcctcc 

gaacagaact 

tttaaagaga 

gaggggttgg 

480 

gggagaggtt 

ctttcaagat 

tactaaattg 

ctatcactag 

acctcgtaga 

actagcaaag 

540 

actacgggtg 

gattgatctt 

gagcaaaaaa 

actttatgag 

aacgaattcg 


590 


<210> SEQ ID NO 2 

<211> LENGTH: 161 

<212> TYPE: DNA 

<213 > ORGANISM: Anabaena sp . 

<22 0> FEATURE: 

<22 1> NAME/KEY: misc_f eature 

<223 > OTHER INFORMATION: Anabaena psbAl promoter sequence 
<400> SEQUENCE: 2 


ggattcccaa 

agataggggg 

aataattaac 

attaagaatt 

attaattcat 

gggtttttag 

60 

tctagtaaat 

ttgcgtgaat 

tcatgtaaat 

tttatgagac 

aggcgcaagt 

ctaaaaaaag 

120 

cgtctgaatt 

aatctgcaca 

aatccaaagc 

aatcataaaa 

a 


161 


<210> SEQ ID NO 3 

<211> LENGTH: 402 

<212 > TYPE: DNA 

<213 > ORGANISM: Anabaena sp . 

<22 0> FEATURE: 

<221> NAME/KEY: misc_f eature 

<223 > OTHER INFORMATION: Anabaena PrbcL promoter sequence 
<400> SEQUENCE: 3 


gttaacaaaa cgtttaaaac tttatgtaat aacaaattta aatatgtaag ttaagaactt 60 
tcaaagaata acttatgcca tttcttgata tattgtgaga caagttacaa attacgtggt 120 
gtgcaatttt ttcatcttgc gctgattact ctactaaata tccgtcaagt aaattggctc 180 
ttagctcgtc tcctgtcaat aaaggaggtc ggcaagagtg cagaagcggg aatgtgtgaa 240 
aactaaccca attcattaaa taccccgaaa tataggggaa tcatctcata ctttccgtaa 300 



25 


US 8,993,303 B2 


26 


-continued 


accgcgaagg tcgtgaaggg ataaaagcaa tttagtgggt gagaagaaca gataaaaaag 360 
aattttttaa ctatggcaag aggaaaaagt aaaagcgtta ac 402 


<210 > SEQ ID NO 4 
<211 > LENGTH: 244 
<212 > TYPE: DNA 
<213 > ORGANISM: E. coli 
<220 > FEATURE: 

<221> NAME / KEY : misc_feature 

<223> OTHER INFORMATION: E. coli tac promoter sequence 
<400 > SEQUENCE: 4 


cgactgcacg 

gtgaccaatg 

cttctggcgt 

caggcagcca 

tcggaagctg 

tggtatggct 

60 

gtgcaggtcg 

taaatcactg 

cataattcgt 

gtcgctcaag 

gcgcactccc 

gttctggata 

12 0 

atgttttttg 

cgccgacatc 

ataacggttc 

tggcaaatat 

tctgaaatga 

gctgttgaca 

180 

attaatcatc 

ggctcgtata 

atgtgtggaa 

ttgtgagcgg 

ataacaattt 

cacacaggaa 

240 

acag 






244 


<210 > SEQ ID NO 5 
<211 > LENGTH: 568 
<212 > TYPE: PRT 

<213 > ORGANISM: Zymomonas mobilis subsp. mobilis CP4 
<220 > FEATURE: 

<221 > NAME /KEY: misc_f eature 

<223 > OTHER INFORMATION: Pyruvate Decarboxylase 


<400 > SEQUENCE: 

5 












Met 

Ser 

Tyr 

Thr 

Val 

Gly 

Thr 

Tyr 

Leu 

Ala 

Glu 

Arg 

Leu 

Val 

Gin 

lie 

1 




5 





10 





15 


Gly 

Leu 

Lys 

His 

His 

Phe 

Ala 

Val 

Ala 

Gly 

Asp 

Tyr 

Asn 

Leu 

Val 

Leu 




20 





25 





30 



Leu 

Asp 

Asn 

Leu 

Leu 

Leu 

Asn 

Lys 

Asn 

Met 

Glu 

Gin 

Val 

Tyr 

Cys 

Cys 



35 





40 





45 




Asn 

Glu 

Leu 

Asn 

Cys 

Gly 

Phe 

Ser 

Ala 

Glu 

Gly 

Tyr 

Ala 

Arg 

Ala 

Lys 


50 





55 





60 





Gly 

Ala 

Ala 

Ala 

Ala 

Val 

Val 

Thr 

Tyr 

Ser 

Val 

Gly 

Ala 

Leu 

Ser 

Ala 

65 





70 





75 





80 

Phe 

Asp 

Ala 

lie 

Gly 

Gly 

Ala 

Tyr 

Ala 

Glu 

Asn 

Leu 

Pro 

Val 

lie 

Leu 





85 





90 





95 


He 

Ser 

Gly 

Ala 

Pro 

Asn 

Asn 

Asn 

Asp 

His 

Ala 

Ala 

Gly 

His 

Val 

Leu 




100 





105 





110 



His 

His 

Ala 

Leu 

Gly 

Lys 

Thr 

Asp 

Tyr 

His 

Tyr 

Gin 

Leu 

Glu 

Met 

Ala 



115 





120 





125 




Lys 

Asn 

lie 

Thr 

Ala 

Ala 

Ala 

Glu 

Ala 

lie 

Tyr 

Thr 

Pro 

Glu 

Glu 

Ala 


130 





135 





140 





Pro 

Ala 

Lys 

lie 

Asp 

His 

Val 

lie 

Lys 

Thr 

Ala 

Leu 

Arg 

Glu 

Lys 

Lys 

145 





150 





155 





160 

Pro 

Val 

Tyr 

Leu 

Glu 

lie 

Ala 

Cys 

Asn 

lie 

Ala 

Ser 

Met 

Pro 

Cys 

Ala 





165 





170 





175 


Ala 

Pro 

Gly 

Pro 

Ala 

Ser 

Ala 

Leu 

Phe 

Asn 

Asp 

Glu 

Ala 

Ser 

Asp 

Glu 




180 





185 





190 



Ala 

Ser 

Leu 

Asn 

Ala 

Ala 

Val 

Glu 

Glu 

Thr 

Leu 

Lys 

Phe 

lie 

Ala 

Asn 



195 





200 





205 




Arg 

Asp 

Lys 

Val 

Ala 

Val 

Leu 

Val 

Gly 

Ser 

Lys 

Leu 

Arg 

Ala 

Ala 

Gly 


210 





215 





220 
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-continued 


Ala 

Glu 

Glu 

Ala 

Ala 

Val 

Lys 

Phe 

Ala 

Asp 

Ala 

Leu 

Gly Gly Ala 

Val 

225 





230 





235 





240 

Ala 

Thr 

Met 

Ala 

Ala 

Ala 

Lys 

Ser 

Phe 

Phe 

Pro 

Glu 

Glu 

Asn 

Pro 

His 





245 





250 





255 


Tyr 

lie 

Gly 

Thr 

Ser 

Trp 

Gly 

Glu 

Val 

Ser 

Tyr 

Pro 

Gly Val 

Glu 

Lys 




260 





265 





270 



Thr 

Met 

Lys 

Glu 

Ala 

Asp 

Ala 

Val 

lie 

Ala 

Leu 

Ala 

Pro 

Val 

Phe 

Asn 



275 





280 





285 




Asp 

Tyr 

Ser 

Thr 

Thr 

Gly 

Trp 

Thr 

Asp 

lie 

Pro 

Asp 

Pro 

Lys 

Lys 

Leu 


290 





295 





300 





Val 

Leu 

Ala 

Glu 

Pro 

Arg 

Ser 

Val 

Val 

Val 

Asn 

Gly Val 

Arg 

Phe 

Pro 

305 





310 





315 





320 

Ser 

Val 

His 

Leu 

Lys 

Asp 

Tyr 

Leu 

Thr 

Arg 

Leu 

Ala 

Gin 

Lys 

Val 

Ser 





325 





330 





335 


Lys 

Lys 

Thr 

Gly 

Ala 

Leu 

Asp 

Phe 

Phe 

Lys 

Ser 

Leu 

Asn 

Ala 

Gly Glu 




340 





345 





350 



Leu 

Lys 

Lys 

Ala 

Ala 

Pro 

Ala 

Asp 

Pro 

Ser 

Ala 

Pro 

Leu 

Val 

Asn 

Ala 



355 





360 





365 




Glu 

lie 

Ala 

Arg 

Gin 

Val 

Glu 

Ala 

Leu 

Leu 

Thr 

Pro 

Asn 

Thr 

Thr 

Val 


370 





375 





380 





He 

Ala 

Glu 

Thr 

Gly 

Asp 

Ser 

Trp 

Phe 

Asn 

Ala 

Gin Arg 

Met 

Lys 

Leu 

385 





390 





395 





400 

Pro 

Asn 

Gly 

Ala 

Arg 

Val 

Glu 

Tyr 

Glu 

Met 

Gin 

Trp 

Gly His 

lie 

Gly 





405 





410 





415 


Trp 

Ser 

Val 

Pro 

Ala 

Ala 

Phe 

Gly 

Tyr 

Ala 

Val 

Gly Ala 

Pro 

Glu Arg 




420 





425 





430 



Arg 

Asn 

lie 

Leu 

Met 

Val 

Gly 

Asp 

Gly 

Ser 

Phe 

Gin 

Leu 

Thr 

Ala 

Gin 



435 





440 





445 




Glu 

Val 

Ala 

Gin 

Met 

Val 

Arg 

Leu 

Lys 

Leu 

Pro 

Val 

lie 

lie 

Phe 

Leu 


450 





455 





460 





lie 

Asn 

Asn 

Tyr 

Gly 

Tyr 

Thr 

lie 

Glu 

Val 

Met 

lie 

His 

Asp 

Gly 

Pro 

465 





470 





475 





480 

Tyr 

Asn 

Asn 

lie 

Lys 

Asn 

Trp 

Asp 

Tyr 

Ala 

Gly 

Leu 

Met 

Glu 

Val 

Phe 





485 





490 





495 


Asn 

Gly 

Asn 

Gly 

Gly 

Tyr 

Asp 

Ser 

Gly 

Ala 

Gly 

Lys 

Gly 

Leu 

Lys 

Ala 




500 





505 





510 



Lys 

Thr 

Gly 

Gly 

Glu 

Leu 

Ala 

Glu 

Ala 

lie 

Lys 

Val 

Ala 

Leu 

Ala 

Asn 



515 





520 





525 




Thr 

Asp 

Gly 

Pro 

Thr 

Leu 

lie 

Glu 

Cys 

Phe 

lie 

Gly Arg 

Glu 

Asp 

Cys 


530 





535 





540 





Thr 

Glu 

Glu 

Leu 

Val 

Lys 

Trp 

Gly 

Lys 

Arg 

Val 

Ala 

Ala 

Ala 

Asn 

Ser 

545 





550 





555 





560 

Arg 

Lys 

Pro 

Val 

Asn 

Lys 

Leu 

Leu 










565 


<210 > SEQ ID NO 6 
<211 > LENGTH: 383 
<212 > TYPE: PRT 

<213 > ORGANISM: Zymomonas mobilis CP4 
<220 > FEATURE: 

<221> NAME / KEY : misc_f eature 

<223 > OTHER INFORMATION: Alcohol Dehydrogenase II 
<400 > SEQUENCE: 6 

Met Ala Ser Ser Thr Phe Tyr lie Pro Phe Val Asn Glu Met Gly Glu 
15 10 15 
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-continued 


Gly 

Ser 

Leu 

Glu 

Lys 

Ala 

lie 

Lys 

Asp 

Leu 

Asn 

Gly 

Ser 

Gly 

Phe 

Lys 




20 





25 





30 



Asn 

Ala 

Leu 

lie 

Val 

Ser 

Asp 

Ala 

Phe 

Met 

Asn 

Lys 

Ser 

Gly 

Val 

Val 



35 





40 





45 




Lys 

Gin 

Val 

Ala 

Asp 

Leu 

Leu 

Lys 

Ala 

Gin 

Gly 

lie 

Asn 

Ser 

Ala 

Val 


50 





55 





60 





Tyr 

Asp 

Gly 

Val 

Met 

Pro 

Asn 

Pro 

Thr 

Val 

Thr 

Ala 

Val 

Leu 

Glu 

Gly 

65 





70 





75 





80 

Leu 

Lys 

He 

Leu 

Lys 

Asp 

Asn 

Asn 

Ser 

Asp 

Phe 

Val 

lie 

Ser 

Leu 

Gly 





85 





90 





95 


Gly 

Gly 

Ser 

Pro 

His 

Asp 

Cys 

Ala 

Lys 

Ala 

lie 

Ala 

Leu 

Val 

Ala 

Thr 




100 





105 





110 



Asn 

Gly 

Gly 

Glu 

Val 

Lys 

Asp 

Tyr 

Glu 

Gly 

lie 

Asp 

Lys 

Ser 

Lys 

Lys 



115 





120 





125 




Pro 

Ala 

Leu 

Pro 

Leu 

Met 

Ser 

lie 

Asn 

Thr 

Thr 

Ala 

Gly 

Thr 

Ala 

Ser 


130 





135 





140 





Glu 

Met 

Thr 

Arg 

Phe 

Cys 

lie 

lie 

Thr 

Asp 

Glu 

Val 

Arg 

His 

Val 

Lys 

145 





150 





155 





160 

Met 

Ala 

lie 

Val 

Asp 

Arg 

His 

Val 

Thr 

Pro 

Met 

Val 

Ser 

Val 

Asn Asp 





165 





170 





175 


Pro 

Leu 

Leu 

Met 

Val 

Gly 

Met 

Pro 

Lys 

Gly 

Leu 

Thr 

Ala 

Ala 

Thr 

Gly 




180 





185 





190 



Met 

Asp 

Ala 

Leu 

Thr 

His 

Ala 

Phe 

Glu 

Ala 

Tyr 

Ser 

Ser 

Thr 

Ala 

Ala 



195 





200 





205 




Thr 

Pro 

lie 

Thr 

Asp 

Ala 

Cys 

Ala 

Leu 

Lys 

Ala 

Ala 

Ser 

Met 

lie 

Ala 


210 





215 





220 





Lys 

Asn 

Leu 

Lys 

Thr 

Ala 

Cys 

Asp 

Asn 

Gly 

Lys 

Asp 

Met 

Pro 

Ala 

Arg 

225 





230 





235 





240 

Glu 

Ala 

Met 

Ala 

Tyr 

Ala 

Gin 

Phe 

Leu 

Ala 

Gly 

Met 

Ala 

Phe 

Asn 

Asn 





245 





250 





255 


Ala 

Ser 

Leu 

Gly 

Tyr 

Val 

His 

Ala 

Met 

Ala 

His 

Gin 

Leu 

Gly 

Gly 

Tyr 




260 





265 





270 



Tyr 

Asn 

Leu 

Pro 

His 

Gly 

Val 

Cys 

Asn 

Ala 

Val 

Leu 

Leu 

Pro 

His 

Val 



275 





280 





285 




Leu 

Ala 

Tyr 

Asn 

Ala 

Ser 

Val 

Val 

Ala 

Gly 

Arg 

Leu 

Lys 

Asp 

Val 

Gly 


290 





295 





300 





Val 

Ala 

Met 

Gly 

Leu 

Asp 

lie 

Ala 

Asn 

Leu 

Gly 

Asp 

Lys 

Glu 

Gly Ala 

305 





310 





315 





320 

Glu 

Ala 

Thr 

lie 

Gin 

Ala 

Val 

Arg 

Asp 

Leu 

Ala 

Ala 

Ser 

lie 

Gly 

lie 





325 





330 





335 


Pro 

Ala 

Asn 

Leu 

Thr 

Glu 

Leu 

Gly 

Ala 

Lys 

Lys 

Glu 

Asp 

Val 

Pro 

Leu 




340 





345 





350 



Leu 

Ala 

Asp 

His 

Ala 

Leu 

Lys 

Asp 

Ala 

Cys 

Ala 

Leu 

Thr 

Asn 

Pro 

Arg 



355 





360 





365 




Gin 

Gly 

Asp 

Gin 

Lys 

Glu 

Val 

Glu 

Glu 

Leu 

Phe 

Leu 

Ser 

Ala 

Phe 



370 375 380 


<210 > SEQ ID NO 7 
<211 > LENGTH: 1045 
<212 > TYPE: PRT 

<213 > ORGANISM: Beta vulgaris subsp. vulgaris 
<220 > FEATURE: 

<221> NAME / KEY : misc_f eature 

<223> OTHER INFORMATION: Sugar beet sucrose -phosphate synthase (SPS) 
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-continued 


<400 > SEQUENCE: 

7 












Met 

Ala 

Gly 

Asn 

Asp 

Trp 

lie 

Asn 

Ser 

Tyr 

Leu 

Glu 

Ala 

lie 

Leu 

Asp 

1 




5 





10 





15 


Val 

Gly 

Pro 

Gly 

Leu 

Asp 

Asp 

Ala 

Lys 

Ser 

Ser 

Leu 

Leu 

Leu 

Arg 

Glu 




20 





25 





30 



Arg 

Gly 

Arg 

Phe 

Ser 

Pro 

Thr 

Arg 

Tyr 

Phe 

Val 

Glu 

Glu 

Val 

lie 

Thr 



35 





40 





45 




Gly 

Phe 

Asp 

Glu 

Thr 

Asp 

Leu 

His 

Arg 

Ser 

Trp 

Val 

Arg 

Ala 

Gin 

Ala 


50 





55 





60 





Thr 

Arg 

Ser 

Pro 

Gin 

Glu 

Arg 

Asn 

Thr 

Arg 

Leu 

Glu 

Asn 

Met 

Cys 

Trp 

65 





70 





75 





80 

Arg 

lie 

Trp 

Asn 

Leu 

Ala 

Arg 

Gin 

Lys 

Lys 

Gin 

Leu 

Glu 

Asn 

Glu 

Glu 





85 





90 





95 


Ala 

Gin 

Arg 

Lys 

Thr 

Lys 

Arg 

Arg 

Met 

Glu 

Leu 

Glu 

Arg 

Gly 

Arg 

Arg 




100 





105 





110 



Glu 

Ala 

Thr 

Ala 

Asp 

Met 

Ser 

Glu 

Asp 

Leu 

Ser 

Glu 

Gly 

Glu 

Lys 

Asp 



115 





120 





125 




He 

Ser 

Ala 

His 

Gly 

Asp 

Ser 

Thr 

Arg 

Pro 

Arg 

Leu 

Pro 

Arg 

lie 

Asn 


130 





135 





140 





Ser 

Leu 

Asp 

Ala 

Met 

Glu 

Thr 

Trp 

lie 

Ser 

Gin 

Gin 

Lys 

Glu 

Lys 

Lys 

145 





150 





155 





160 

Leu 

Tyr 

Leu 

Val 

Leu 

lie 

Ser 

Leu 

His 

Gly 

Leu 

lie 

Arg 

Gly 

Glu 

Asn 





165 





170 





175 


Met 

Glu 

Leu 

Gly 

Arg 

Asp 

Ser 

Asp 

Thr 

Gly 

Gly 

Gin 

Val 

Lys 

Tyr 

Val 




180 





185 





190 



Val 

Glu 

Leu 

Ala 

Arg 

Ala 

Leu 

Gly 

Ser 

Met 

Pro 

Gly 

Val 

Tyr 

Arg 

Val 



195 





200 





205 




Asp 

Leu 

Leu 

Thr 

Arg 

Gin 

Val 

Ser 

Ser 

Pro 

Asp 

Val 

Asp 

Trp 

Ser 

Tyr 


210 





215 





220 





Gly 

Glu 

Pro 

Thr 

Glu 

Met 

Leu 

Asn 

Pro 

Arg 

Asp 

Ser 

Asn 

Gly 

Phe 

Asp 

225 





230 





235 





240 

Asp 

Asp 

Asp 

Asp 

Glu 

Met 

Gly 

Glu 

Ser 

Ser 

Gly 

Ala 

Tyr 

lie 

Val 

Arg 





245 





250 





255 


lie 

Pro 

Phe 

Gly 

Pro 

Arg 

Asp 

Lys 

Tyr 

lie 

Ala 

Lys 

Glu 

Glu 

Leu 

Trp 




260 





265 





270 



Pro 

Tyr 

lie 

Pro 

Glu 

Phe 

Val 

Asp 

Gly 

Ala 

Leu 

Asn 

His 

lie 

Val 

Gin 



275 





280 





285 




Met 

Ser 

Lys 

Val 

Leu 

Gly 

Glu 

Gin 

lie 

Gly 

Ser 

Gly 

Glu 

Thr 

Val 

Trp 


290 





295 





300 





Pro 

Val 

Ala 

lie 

His 

Gly 

His 

Tyr 

Ala 

Asp 

Ala 

Gly 

Asp 

Ser 

Ala 

Ala 

305 





310 





315 





320 

Leu 

Leu 

Ser 

Gly 

Gly 

Leu 

Asn 

Val 

Pro 

Met 

Leu 

Leu 

Thr 

Gly 

His 

Ser 





325 





330 





335 


Leu 

Gly 

Arg 

Asp 

Lys 

Leu 

Glu 

Gin 

Leu 

Leu 

Lys 

Gin 

Gly 

Arg 

Met 

Ser 




340 





345 





350 



Lys 

Asp 

Asp 

lie 

Asn 

Asn 

Thr 

Tyr 

Lys 

lie 

Met 

Arg 

Arg 

lie 

Glu 

Ala 



355 





360 





365 




Glu 

Glu 

Leu 

Ser 

Leu 

Asp 

Ala 

Ser 

Glu 

lie 

Val 

lie 

Thr 

Ser 

Thr 

Arg 


370 





375 





380 





Gin 

Glu 

lie 

Glu 

Glu 

Gin 

Trp 

His 

Leu 

Tyr 

Asp 

Gly 

Phe 

Asp 

Pro 

Val 

385 





390 





395 





400 

Leu 

Glu 

Arg 

Lys 

Leu 

Arg 

Ala 

Arg 

Met 

Lys 

Arg 

Gly 

Val 

Ser 

Cys 

Tyr 





405 





410 





415 
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Gly Arg 

Phe 

Met 

Pro 

Arg 

Met 

Val 

Val 

lie 

Pro 

Pro 

Gly 

Met 

Glu 

Phe 




420 





425 





430 



Asn 

His 

lie 

Val 

Pro 

His 

Glu 

Gly 

Asp 

Met 

Asp 

Gly 

Glu 

Thr 

Glu 

Glu 



435 





440 





445 




Thr 

Glu 

Glu 

His 

Pro 

Thr 

Ser 

Pro 

Asp 

Pro 

Pro 

lie 

Trp 

Ala 

Glu 

lie 


450 





455 





460 





Met 

Arg 

Phe 

Phe 

Ser 

Lys 

Pro 

Arg 

Lys 

Pro 

Met 

lie 

Leu 

Ala 

Leu 

Ala 

465 





470 





475 





480 

Arg 

Pro 

Asp 

Pro 

Lys 

Lys 

Asn 

lie 

Thr 

Thr 

Leu 

Val 

Lys 

Ala 

Phe 

Gly 





485 





490 





495 


Glu 

Cys 

Arg 

Pro 

Leu 

Arg 

Glu 

Leu 

Ala 

Asn 

Leu 

Thr 

Leu 

lie 

Met 

Gly 




500 





505 





510 



Asn 

Arg 

Asp 

Gly 

lie 

Asp 

Glu 

Met 

Ser 

Ser 

Thr 

Ser 

Ser 

Ser 

Val 

Leu 



515 





520 





525 




Leu 

Ser 

Val 

Leu 

Lys 

Leu 

lie 

Asp 

Gin 

Tyr 

Asp 

Leu 

Tyr 

Gly 

Gin 

Val 


530 





535 





540 





Ala 

Tyr 

Pro 

Lys 

His 

His 

Lys 

Gin 

Ala 

Asp 

Val 

Pro 

Glu 

lie 

Tyr 

Arg 

545 





550 





555 





560 

Leu 

Ala 

Ala 

Lys 

Thr 

Lys 

Gly 

Val 

Phe 

lie 

Asn 

Pro 

Ala 

Phe 

lie 

Glu 





565 





570 





575 


Pro 

Phe 

Gly 

Leu 

Thr 

Leu 

lie 

Glu 

Ala 

Ala 

Ala 

His 

Gly 

Leu 

Pro 

Met 




580 





585 





590 



Val 

Ala 

Thr 

Lys 

Asn 

Gly 

Gly 

Pro 

Val 

Asp 

lie 

Gin 

Arg 

Val 

Leu 

Asp 



595 





600 





605 




Asn 

Gly 

Leu 

Leu 

Val 

Asp 

Pro 

His 

Glu 

Gin 

Gin 

Ser 

lie 

Ala 

Thr 

Ala 


610 





615 





620 





Leu 

Leu 

Lys 

Leu 

Val 

Ala 

Asp 

Lys 

Gin 

Leu 

Trp 

Thr 

Lys 

Cys 

Gin 

Gin 

625 





630 





635 





640 

Asn 

Gly 

Leu 

Lys 

Asn 

lie 

His 

Leu 

Tyr 

Ser 

Trp 

Pro 

Glu 

His 

Ser 

Lys 





645 





650 





655 


Thr 

Tyr 

Leu 

Ser 

Arg 

lie 

Ala 

Ser 

Ser 

Arg 

Gin 

Arg 

Gin 

Pro 

Gin 

Trp 




660 





665 





670 



Gin 

Arg 

Ser 

Ser 

Asp 

Glu 

Gly 

Leu 

Asp 

Asn 

Gin 

Glu 

Pro 

Glu 

Ser 

Pro 



675 





680 





685 




Ser 

Asp 

Ser 

Leu 

Arg 

Asp 

lie 

Lys 

Asp 

lie 

Ser 

Leu 

Asn 

Leu 

Glu 

Val 


690 





695 





700 





Leu 

Val 

Arg 

Pro 

Glu 

Lys 

Arg 

Val 

Lys 

Thr 

Leu 

Lys 

lie 

Leu 

Gly 

Leu 

705 





710 





715 





720 

Met 

Thr 

Lys 

Ala 

Asn 

Ser 

Arg 

Met 

Leu 

Leu 

Cys 

Ser 

Trp 

Ser 

Asn 

Gly 





725 





730 





735 


Val 

His 

Lys 

Met 

Leu 

Arg 

Lys 

Ala 

Arg 

Phe 

Ser 

Asp 

Lys 

Val 

Asp 

Gin 




740 





745 





750 



Ala 

Ser 

Ser 

Lys 

Tyr 

Pro 

Ala 

Phe 

Arg 

Arg 

Arg 

Lys 

Leu 

lie 

Tyr 

Val 



755 





760 





765 




He 

Ala 

Val 

Asp 

Gly 

Asp 

Tyr 

Glu 

Asp 

Gly 

Leu 

Phe 

Asp 

lie 

Val 

Arg 


770 





775 





780 





Arg 

lie 

Phe 

Asp 

Ala 

Ala 

Gly 

Lys 

Glu 

Lys 

lie 

Glu 

Gly 

Ser 

lie 

Gly 

785 





790 





795 





800 

Phe 

lie 

Leu 

Ser 

Thr 

Ser 

Tyr 

Ser 

Met 

Pro 

Glu 

lie 

Gin 

Asn 

Tyr 

Leu 





805 





810 





815 


Leu 

Ser 

Lys 

Gly 

Phe 

Asn 

Leu 

His 

Asp 

Phe 

Asp 

Ala 

Tyr 

lie 

Cys 

Asn 




820 





82 5 





830 
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Ser 

Gly 

Ser 

Glu 

Leu 

Tyr 

Tyr 

Ser 

Ser 

Leu 

Asn 

Ser 

Glu 

Glu 

Ser 

Asn 



835 





84 0 





845 




He 

lie 

Ala 

Asp 

Ser 

Asp 

Tyr 

His 

Ser 

His 

lie 

Glu 

Tyr 

Arg 

Trp 

Gly 


850 





855 





860 





Gly 

Glu 

Gly 

Leu 

Arg 

Arg 

Thr 

Leu 

Leu 

Arg 

Trp 

Ala 

Ala 

Ser 

lie 

Thr 

865 





870 





875 





880 

Glu 

Lys 

Asn 

Gly 

Glu 

Asn 

Glu 

Glu 

Gin 

Val 

lie 

Thr 

Glu 

Asp 

Glu 

Glu 





885 





890 





895 


Val 

Ser 

Thr 

Gly 

Tyr 

Cys 

Phe 

Ala 

Phe 

Lys 

lie 

Lys 

Asn 

Gin 

Asn 

Lys 




900 





905 





910 



Val 

Pro 

Pro 

Thr 

Lys 

Glu 

Leu 

Arg 

Lys 

Ser 

Met 

Arg 

lie 

Gin 

Ala 

Leu 



915 





920 





925 




Arg 

Cys 

His 

Val 

lie 

Tyr 

Cys 

Gin 

Asn 

Gly 

Ser 

Lys 

Met 

Asn 

Val 

lie 


930 





935 





940 





Pro 

Val 

Leu 

Ala 

Ser 

Arg 

Ser 

Gin 

Ala 

Leu 

Arg 

Tyr 

Leu 

Tyr 

Val 

Arg 

945 





950 





955 





960 

Trp 

Gly 

Val 

Glu 

Leu 

Ser 

Lys 

Met 

Val 

Val 

Phe 

Val 

Gly 

Glu 

Cys 

Gly 





965 





970 





975 


Asp 

Thr 

Asp 

Tyr 

Glu 

Gly 

Leu 

Leu 

Gly 

Gly 

Val 

His 

Lys 

Thr 

Val 

lie 


980 985 990 


Leu Lys Gly Val Ser Asn Thr Ala Leu Arg Ser Leu His Ala Asn Arg 
995 1000 1005 


Ser 

Tyr 

1010 

Pro 

Leu 

Ser 

His 

Val 

1015 

Val 

Ser 

Leu Asp 

Ser 

1020 

Pro 

Asn 

lie 

Gly 

Glu 

1025 

Val 

Ser 

Lys 

Gly 

Cys 

1030 

Ser 

Ser 

Ser Glu 

lie 

1035 

Gin 

Ser 

lie 

Val 

Thr 

1040 

Lys 

Leu 

Ser 

Lys 

Ala 

1045 









<210 > SEQ ID NO 8 
<211 > LENGTH: 1060 
<212 > TYPE: PRT 

<213 > ORGANISM: Saccharum hybrid cultivar ROC22 
<220 > FEATURE: 

<221 > NAME/KEY: misc_feature 

<223 > OTHER INFORMATION: Sugarcane sucrose phosphate synthase (SPS) 
<400 > SEQUENCE: 8 


Met 

Ala 

Gly 

Asn 

Asp 

Trp 

lie 

Asn 

Ser 

Tyr 

Leu 

Glu 

Ala 

lie 

Leu 

Asp 

1 




5 





10 





15 


Ala 

Gly 

Gly 

Ala 

Ala 

Gly 

Glu 

lie 

Ser 

Ala 

Ala 

Ala 

Gly 

Ser 

Gly 

Gly 




20 





25 





30 



Gly 

Gly 

Asp 

Gly 

Thr 

Ala 

Gly 

Glu 

Lys 

Arg 

Asp 

Lys 

Ser 

Ser 

Leu 

Met 



35 





40 





45 




Leu 

Arg 

Glu 

Arg 

Gly 

Arg 

Phe 

Asn 

Pro 

Ala 

Arg 

Tyr 

Phe 

Val 

Glu 

Glu 


50 





55 





60 





Val 

lie 

Ser 

Gly 

Phe 

Asp 

Glu 

Thr 

Asp 

Leu 

Tyr 

Lys 

Thr 

Trp 

Val 

Arg 

65 





70 





75 





80 

Thr 

Ser 

Ala 

Met 

Arg 

Ser 

Pro 

Gin 

Glu 

Arg 

Asn 

Thr 

Arg 

Leu 

Glu 

Asn 





85 





90 





95 


Met 

Ser 

Trp 

Arg 

lie 

Trp 

Asn 

Leu 

Ala 

Arg 

Lys 

Lys 

Lys 

Gin 

lie 

Lys 




100 





105 





110 



Gly 

Glu 

Glu 

Ala 

Ser 

Arg 

Leu 

Ser 

Lys 

Arg 

Arg 

Met 

Glu 

Leu 

Glu 

Lys 



115 





12 0 





125 




Ala 

Arg 

Gin 

Tyr 

Ala 

Ala 

Thr 

Asp 

Leu 

Ser 

Glu 

Asp 

Leu 

Ser 

Glu 

Gly 


130 





135 





140 
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Glu 

Lys 

Gly 

Glu 

Thr 

Asn 

Asn 

Glu 

Pro 

Ser 

lie 

His 

Asp 

Glu 

Ser 

Met 

145 





150 





155 





160 

Arg 

Thr 

Arg 

Met 

Pro 

Arg 

lie 

Gly 

Ser 

Thr 

Asp 

Ala 

lie 

Glu 

Thr 

Trp 





165 





170 





175 


Ala 

Asn 

Gin 

His 

Lys 

Asp 

Lys 

Lys 

Leu 

Tyr 

lie 

Val 

Leu 

lie 

Ser 

lie 




180 





185 





190 



His 

Gly 

Leu 

lie 

Arg 

Gly 

Glu 

Asn 

Met 

Glu 

Leu 

Gly 

Arg 

Asp 

Ser 

Asp 



195 





200 





205 




Thr 

Gly 

Gly 

Gin 

Val 

Lys 

Tyr 

Val 

Val 

Glu 

Leu 

Ala 

Arg 

Ala 

Leu 

Gly 


210 





215 





220 





Ser 

Thr 

Pro 

Gly 

Val 

Tyr 

Arg 

Val 

Asp 

Leu 

Leu 

Thr 

Arg 

Gin 

lie 

Ser 

225 





230 





235 





240 

Ala 

Pro 

Asp 

Val 

Asp 

Trp 

Ser 

Tyr 

Gly 

Glu 

Pro 

Thr 

Glu 

Met 

Leu 

Ser 





245 





250 





255 


Pro 

lie 

Ser 

Ser 

Glu 

Asn 

Phe 

Gly 

His 

Glu 

Leu 

Gly 

Glu 

Ser 

Ser 

Gly 




260 





265 





270 



Ala 

Tyr 

lie 

Val 

Arg 

lie 

Pro 

Phe 

Gly 

Pro 

Arg 

Asp 

Lys 

Tyr 

lie 

Pro 



275 





280 





285 




Lys 

Glu 

His 

Leu 

Trp 

Pro 

His 

lie 

Gin 

Glu 

Phe 

Val 

Asp 

Gly 

Ala 

Leu 


290 





295 





300 





Val 

His 

lie 

Met 

Gin 

Met 

Ser 

Lys 

Val 

Leu 

Gly 

Glu 

Gin 

lie 

Gly 

Ser 

305 





310 





315 





320 

Gly 

Gin 

Pro 

Val 

Trp 

Pro 

Val 

Val 

lie 

His 

Gly 

His 

Tyr 

Ala 

Asp 

Ala 





325 





330 





335 


Gly Asp 

Ser 

Ala 

Ala 

Leu 

Leu 

Ser 

Gly 

Ala 

Leu 

Asn 

Val 

Pro 

Met 

Val 




340 





345 





350 



Phe 

Thr 

Gly 

His 

Ser 

Leu 

Gly 

Arg 

Asp 

Lys 

Leu 

Glu 

Gin 

lie 

Leu 

Lys 



355 





360 





365 




Gin 

Gly 

Arg 

Gin 

Thr 

Arg 

Asp 

Glu 

lie 

Asn 

Ala 

Thr 

Tyr 

Lys 

lie 

Met 


370 





375 





380 





Arg 

Arg 

lie 

Glu 

Ala 

Glu 

Glu 

Leu 

Cys 

Leu 

Asp 

Thr 

Ser 

Glu 

lie 

lie 

385 





390 





395 





400 

He 

Thr 

Ser 

Thr 

Arg 

Gin 

Glu 

lie 

Glu 

Gin 

Gin 

Trp 

Gly 

Leu 

Tyr 

Asp 





405 





410 





415 


Gly 

Phe 

Asp 

Leu 

Thr 

Met 

Ala 

Arg 

Lys 

Leu 

Arg 

Ala 

Arg 

lie 

Lys 

Arg 




420 





425 





430 



Gly 

Val 

Ser 

Cys 

Phe 

Gly 

Arg 

Tyr 

Met 

Pro 

Arg 

Met 

lie 

Ala 

lie 

Pro 



435 





440 





445 




Pro 

Gly 

Met 

Glu 

Phe 

Ser 

His 

lie 

Ala 

Pro 

His 

Asp 

Val 

Asp 

Leu 

Asp 


450 





455 





460 





Ser 

Glu 

Glu 

Gly 

Asn 

Glu 

Asp 

Gly 

Ser 

Gly 

Ser 

Pro 

Asp 

Pro 

Pro 

lie 

465 





470 





475 





480 

Trp 

Ala 

Asp 

lie 

Met 

Arg 

Phe 

Phe 

Ser 

Asn 

Pro 

Arg 

Lys 

Pro 

Met 

lie 





485 





490 





495 


Leu 

Ala 

Leu 

Ala 

Arg 

Pro 

Asp 

Pro 

Lys 

Lys 

Asn 

lie 

Thr 

Thr 

Leu 

Val 




500 





505 





510 



Lys 

Ala 

Phe 

Gly 

Glu 

His 

Arg 

Glu 

Leu 

Arg 

Asn 

Leu 

Ala 

Asn 

Leu 

Thr 



515 





52 0 





525 




Leu 

lie 

Met 

Gly 

Asn 

Arg 

Asp 

Val 

lie 

Asp 

Glu 

Met 

Ser 

Ser 

Thr 

Asn 


530 





535 





540 





Ala 

Ala 

Val 

Leu 

Thr 

Ser 

Val 

Leu 

Lys 

Leu 

lie 

Asp 

Lys 

Tyr 

Asp 

Leu 


545 550 555 560 
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-continued 


Tyr 

Gly 

Gin 

Val 

Ala 

Tyr 

Pro 

Lys 

His 

His 

Lys 

Gin 

Phe 

Glu 

Val 

Pro 





565 





570 





575 


Asp 

lie 

Tyr 

Arg 

Leu 

Ala 

Ala 

Arg 

Thr 

Lys 

Gly 

Val 

Phe 

lie 

Asn 

Cys 




580 





585 





590 



Ala 

Phe 

lie 

Glu 

Pro 

Phe 

Gly 

Leu 

Thr 

Leu 

lie 

Glu 

Ala 

Ala 

Ala 

Tyr 



595 





600 





605 




Gly 

Leu 

Pro 

lie 

Val 

Ala 

Thr 

Arg 

Asn 

Gly 

Gly 

Pro 

Val 

Asp 

lie 

His 


610 





615 





620 





Arg 

Val 

Leu 

Asp 

Asn 

Gly 

lie 

Leu 

Val 

Asp 

Pro 

His 

Asn 

Gin 

Asn 

Glu 

625 





630 





635 





640 

He 

Gly 

Glu 

Ala 

Leu 

Tyr 

Lys 

Leu 

Val 

Ser 

Asp 

Lys 

Gin 

Leu 

Trp 

Thr 





645 





650 





655 


Arg 

Cys 

Arg 

Gin 

Asn 

Gly 

Leu 

Lys 

Asn 

lie 

His 

Gin 

Phe 

Ser 

Trp 

Pro 




660 





665 





670 



Glu 

His 

Cys 

Lys 

Asn 

Tyr 

Leu 

Ala 

Arg 

Val 

Val 

Thr 

Leu 

Lys 

Pro 

Arg 



675 





680 





685 




His 

Pro 

Arg 

Trp 

Gin 

Lys 

Asn 

Asp 

Val 

Ala 

Thr 

Glu 

lie 

Ser 

Glu 

Ala 


690 





695 





700 





Asp 

Ser 

Pro 

Glu 

Asp 

Ser 

Leu 

Arg 

Asp 

lie 

His 

Asp 

lie 

Ser 

Leu 

Asn 

705 





710 





715 





720 

Leu 

Gin 

Leu 

Ser 

Leu 

Asp 

Ser 

Glu 

Lys 

Ser 

Gly 

Ser 

Lys 

Glu 

Gly 

Asn 





725 





730 





735 


Ser 

Asn 

Thr 

Val 

Arg 

Arg 

His 

Leu 

Glu 

Asp 

Ala 

Val 

Gin 

Lys 

Leu 

Ser 




740 





74 5 





750 



Gly Val 

Ser 

Asp 

lie 

Lys 

Lys 

Asp 

Gly 

Pro 

Gly 

Glu 

Asn 

Gly 

Lys 

Trp 



755 





760 





765 




Pro 

Ser 

Leu 

Arg 

Arg 

Arg 

Lys 

His 

lie 

lie 

Val 

lie 

Ala 

Val 

Asp 

Ser 


770 





775 





780 





Val 

Gin 

Asp 

Ala 

Asp 

Phe 

Val 

Gin 

Val 

lie 

Lys 

Asn 

lie 

Phe 

Glu 

Ala 

785 





790 





795 





800 

Ser 

Ser 

Asn 

Glu 

Arg 

Ser 

Ser 

Gly 

Ala 

Val 

Gly 

Phe 

Val 

Leu 

Ser 

Thr 





805 





810 





815 


Ala 

Arg 

Ala 

lie 

Ser 

Glu 

lie 

His 

Ala 

Leu 

Leu 

lie 

Ser 

Gly 

Arg 

lie 




820 





825 





830 



Glu 

Ala 

Ser 

Asp 

Phe 

Asp 

Ala 

Phe 

lie 

Cys 

Asn 

Ser 

Gly 

Ser 

Asp 

Leu 



835 





840 





845 




Cys 

Tyr 

Pro 

Ser 

Ser 

Ser 

Ser 

Glu 

Asp 

Met 

Leu 

Ser 

Pro 

Ala 

Glu 

Leu 


850 





855 





860 





Pro 

Phe 

Met 

lie 

Asp 

Leu 

Asp 

Tyr 

His 

Ser 

Gin 

lie 

Glu 

Tyr 

Arg 

Trp 

865 





870 





875 





880 

Gly 

Gly 

Glu 

Gly 

Leu 

Arg 

Lys 

Thr 

Leu 

lie 

Arg 

Trp 

Ala 

Ala 

Glu 

Lys 





885 





890 





895 


Asn 

Asn 

Glu 

Ser 

Gly 

Gin 

Lys 

lie 

Leu 

Val 

Glu 

Asp 

Glu 

Glu 

Cys 

Ser 




900 





905 





910 



Ser 

Thr 

Tyr 

Cys 

lie 

Ser 

Phe 

Lys 

Val 

Ser 

Asn 

Thr 

Ala 

Ala 

Ala 

Pro 



915 





920 





925 




Pro 

Val 

Lys 

Glu 

lie 

Arg 

Arg 

Thr 

Met 

Arg 

lie 

Gin 

Ala 

Leu 

Arg 

Cys 


930 





935 





940 





His 

Val 

Leu 

Tyr 

Ser 

His 

Asp 

Gly 

Ser 

Lys 

Leu 

Asn 

Val 

lie 

Pro 

Val 

945 





950 





955 





960 

Leu 

Ala 

Ser 

Arg 

Ser 

Gin 

Ala 

Leu 

Arg 

Tyr 

Leu 

Tyr 

lie 

Arg 

Trp 

Gly 


965 970 975 


Val Glu Leu Ser Asn lie Thr Val lie Val Gly Glu Cys Gly Asp Thr 
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-continued 


980 985 990 

Asp Tyr Glu Gly Leu Leu Gly Gly Val His Lys Thr lie lie Leu Lys 
995 1000 1005 


Gly 

Ser 

1010 

Phe 

Asn 

Thr 

Ala 

Pro 

1015 

Asn 

Gin 

Val 

His 

Ala 

1020 

Asn 

Arg 

Ser 

Tyr 

Ser 

1025 

Leu 

Gin 

Asp 

Val 

Val 

1030 

Ser 

Phe 

Glu 

Lys 

Gin 

1035 

Gly 

lie 

Ser 

Ser 

He 

1040 

Glu 

Gly 

Tyr 

Gly 

Pro 

1045 

Asp 

Asn 

Leu 

Lys 

Ser 

1050 

Ala 

Leu 

Arg 

Gin 

Phe 

Gly 

lie 

Leu 

Lys 

Asp 










1055 1060 


<210 > SEQ ID NO 9 
<211 > LENGTH: 420 
<212 > TYPE: PRT 

<213 > ORGANISM: Saccharum officinarum 
<220 > FEATURE: 

<221> NAME / KEY : misc_feature 

<223 > OTHER INFORMATION: Sugarcane sucrose phosphate phosphatase (SPP) 
<400 > SEQUENCE: 9 


Met 

Asp 

Lys 

Leu 

Ser 

Gly 

Ser 

Val 

Arg 

Leu 

Met 

lie 

Val 

Ser 

Asp 

Leu 

1 




5 





10 





15 


Asp 

His 

Thr 

Met 

Val 

Asp 

His 

His 

Asp 

Glu 

Glu 

Asn 

Leu 

Ser 

Leu 

Leu 




20 





25 





30 



Arg 

Phe 

Gly 

Ala 

Leu 

Trp 

Glu 

Ser 

Val 

Tyr 

Cys 

Glu 

Asp 

Ser 

Leu 

Leu 



35 





40 





45 




Val 

Phe 

Ser 

Thr 

Gly 

Arg 

Ser 

Pro 

Thr 

Leu 

Tyr 

Lys 

Glu 

Leu 

Arg 

Lys 


50 





55 





60 





Glu 

Lys 

Pro 

Met 

Leu 

Thr 

Pro 

Asp 

lie 

Thr 

lie 

Met 

Ser 

Val 

Gly 

Thr 

65 





70 





75 





80 

Glu 

lie 

Thr 

Tyr 

Gly 

Glu 

Ala 

Met 

Val 

Pro 

Asp 

Asp 

Gly 

Trp 

Glu 

Gin 





85 





90 





95 


Tyr 

Leu 

Asn 

Asn 

Lys 

Trp 

Asp 

Arg 

Asn 

lie 

Val 

Val 

Glu 

Glu 

Thr 

Ala 




100 





105 





110 



Ser 

Phe 

Ser 

Glu 

Leu 

Lys 

Leu 

Gin 

Pro 

Glu 

Thr 

Glu 

Gin 

Arg 

Pro 

His 



115 





120 





125 




Lys 

Val 

Ser 

Phe 

Leu 

Val 

Asp 

Lys 

Lys 

Ser 

Ala 

Gin 

Glu 

Val 

lie 

Lys 


130 





135 





140 





Ser 

Val 

Ala 

Glu 

Arg 

Leu 

Asp 

Lys 

Arg 

Gly 

Leu 

Asp 

Ala 

Lys 

lie 

lie 

145 





150 





155 





160 

Tyr 

Ser 

Gly 

Gly 

Gin 

Asp 

Leu 

Asp 

lie 

Leu 

Pro 

Gin 

Gly 

Ala 

Gly 

Lys 





165 





170 





175 


Gly 

Gin 

Ala 

Leu 

Ala 

Tyr 

Leu 

Leu 

Lys 

Lys 

Leu 

Ser 

Ser 

Cys 

Gly 

Lys 




180 





185 





190 



Pro 

Pro 

Asn 

Asn 

Thr 

Leu 

Val 

Cys 

Gly 

Asp 

Ser 

Gly 

Asn 

Asp 

Ala 

Glu 



195 





200 





205 




Leu 

Phe 

Ser 

lie 

Pro 

Gly 

Val 

Met 

Val 

Ser 

Asn 

Ala 

Gin 

Glu 

Glu 

Leu 


210 





215 





220 





Leu 

Gin 

Trp 

Tyr 

Ala 

Glu 

Asn 

Ala 

Lys 

Asp 

Asn 

Pro 

Lys 

lie 

lie 

His 

225 





230 





235 





240 

Ala 

Asn 

Glu 

Arg 

Cys 

Ala 

Ala 

Gly 

lie 

lie 

Gin 

Ala 

lie 

Gly 

His 

Phe 


245 250 255 


Lys Leu Gly Pro Asn lie Ser Pro Arg Asp Val Asp Phe Pro Tyr Ala 
260 265 270 
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-continued 


Lys 

Glu 

Ala 

Ser 

Phe 

Lys 

Pro 

Ser 

Asp 

Ala 

Val 

Val 

Lys 

Phe 

Tyr 

Val 



275 





280 





285 




Leu 

Tyr 

Glu 

Lys 

Trp 

Arg 

Arg 

Ala 

Glu 

Val 

Pro 

Lys 

Ser 

Asp 

Ser 

Val 


290 





295 





300 





He 

Lys 

Tyr 

Phe 

Lys 

Asn 

lie 

Thr 

His 

Ala 

Asn 

Gly 

Val 

lie 

lie 

His 

305 





310 





315 





320 

Pro 

Ala 

Gly 

Leu 

Glu 

Leu 

Ser 

Leu 

His 

Ala 

Ser 

lie 

Asp 

Ala 

Leu 

Gly 





325 





330 





335 


Ser 

Cys 

Tyr 

Gly 

Asp 

Lys 

Gin 

Gly 

Lys 

Lys 

Tyr 

Arg 

Ala 

Trp 

Val 

Asp 




340 





345 





350 



Arg 

Leu 

Ala 

lie 

Thr 

Gin 

Thr 

Gly 

Ser 

Asp 

Ser 

Trp 

Val 

Val 

Arg 

Phe 



355 





360 





365 




Asp 

Leu 

Trp 

Glu 

Ser 

Glu 

Gly 

Asp 

Val 

Arg 

Val 

Cys 

Ser 

Leu 

Ser 

Ser 


370 





375 





380 





Leu 

Ala 

Leu 

Val 

Leu 

Lys 

Ala 

Glu 

Ser 

Pro 

Glu Gly 

Phe 

Val 

Leu 

Thr 

385 





390 





395 





400 

His 

lie 

Gin 

Lys 

Thr 

Trp 

Leu 

Asn 

Gly 

Tyr 

Ser 

Ser 

Gly 

Val 

Glu 

Gin 





405 





410 





415 



Ala Phe Lys Val 
420 


<210 > SEQ ID NO 10 
<211 > LENGTH: 720 
<212 > TYPE: PRT 

<213 > ORGANISM: Synechocyst is sp . 

<220 > FEATURE: 

<221 > NAME/KEY: misc_f eature 

<223 > OTHER INFORMATION: Synechocyst is sucrose phosphate synthase (SPS) 


<400 > SEQUENCE: 

10 












Met 

Ser 

Tyr 

Ser 

Ser 

Lys 

Tyr 

lie 

Leu 

Leu 

lie 

Ser 

Val 

His 

Gly 

Leu 

1 




5 





10 





15 


lie 

Arg 

Gly 

Glu 

Asn 

Leu 

Glu 

Leu 

Gly 

Arg Asp 

Ala 

Asp 

Thr 

Gly 

Gly 




20 





25 





30 



Gin 

Thr 

Lys 

Tyr 

Val 

Leu 

Glu 

Leu 

Ala 

Arg 

Ala 

Leu 

Val 

Lys 

Asn 

Pro 



35 





40 





45 




Gin 

Val 

Ala 

Arg 

Val 

Asp 

Leu 

Leu 

Thr 

Arg 

Leu 

lie 

Lys 

Asp 

Pro 

Lys 


50 





55 





60 





Val 

Asp 

Ala 

Asp 

Tyr 

Ala 

Gin 

Pro 

Arg 

Glu 

Leu 

lie 

Gly 

Asp 

Arg 

Ala 

65 





70 





75 





80 

Gin 

lie 

Val 

Arg 

lie 

Glu 

Cys 

Gly 

Pro 

Glu 

Glu 

Tyr 

lie 

Ala 

Lys 

Glu 





85 





90 





95 


Met 

Leu 

Trp 

Asp 

Tyr 

Leu 

Asp 

Asn 

Phe 

Ala 

Asp 

His 

Ala 

Leu 

Asp 

Tyr 




100 





105 





110 



Leu 

Lys 

Glu 

Gin 

Pro 

Glu 

Leu 

Pro 

Asp 

Val 

lie 

His 

Ser 

His 

Tyr 

Ala 



115 





120 





125 




Asp 

Ala 

Gly 

Tyr 

Val 

Gly 

Thr 

Arg 

Leu 

Ser 

His 

Gin 

Leu 

Gly 

lie 

Pro 


130 





135 





140 





Leu 

Val 

His 

Thr 

Gly 

His 

Ser 

Leu 

Gly 

Arg 

Ser 

Lys 

Arg 

Thr 

Arg 

Leu 

145 





150 





155 





160 

Leu 

Leu 

Ser 

Gly 

lie 

Lys 

Ala 

Asp 

Glu 

lie 

Glu 

Ser 

Arg 

Tyr 

Asn 

Met 





165 





170 





175 


Ala 

Arg 

Arg 

lie 

Asn 

Ala 

Glu 

Glu 

Glu 

Thr 

Leu 

Gly 

Ser 

Ala 

Ala 

Arg 




180 





185 





190 




Val lie Thr Ser Thr His Gin Glu lie Ala Glu Gin Tyr Ala Gin Tyr 
195 200 205 
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-continued 


Asp 

Tyr 

Tyr 

Gin 

Pro 

Asp 

Gin 

Met 

Leu 

Val 

lie 

Pro 

Pro 

Gly 

Thr 

Asp 


210 





215 





220 





Leu 

Glu 

Lys 

Phe 

Tyr 

Pro 

Pro 

Lys 

Gly 

Asn 

Glu 

Trp 

Glu 

Thr 

Pro 

lie 

225 





230 





235 





240 

Val 

Gin 

Glu 

Leu 

Gin 

Arg 

Phe 

Leu 

Arg 

His 

Pro 

Arg 

Lys 

Pro 

lie 

lie 





245 





250 





255 


Leu 

Ala 

Leu 

Ser 

Arg 

Pro 

Asp 

Pro 

Arg 

Lys 

Asn 

lie 

His 

Lys 

Leu 

lie 




260 





265 





270 



Ala 

Ala 

Tyr 

Gly 

Gin 

Ser 

Pro 

Gin 

Leu 

Gin 

Ala 

Gin 

Ala 

Asn 

Leu 

Val 



275 





280 





285 




He 

Val 

Ala 

Gly 

Asn 

Arg 

Asp 

Asp 

lie 

Thr 

Asp 

Leu 

Asp 

Gin 

Gly 

Pro 


290 





295 





300 





Arg 

Glu 

Val 

Leu 

Thr 

Asp 

Leu 

Leu 

Leu 

Thr 

lie 

Asp 

Arg 

Tyr 

Asp 

Leu 

305 





310 





315 





320 

Tyr 

Gly 

Lys 

Val 

Ala 

Tyr 

Pro 

Lys 

Gin 

Asn 

Gin 

Ala 

Glu 

Asp 

Val 

Tyr 





325 





330 





335 


Ala 

Leu 

Phe 

Arg 

Leu 

Thr 

Ala 

Leu 

Ser 

Gin 

Gly 

Val 

Phe 

lie 

Asn 

Pro 




340 





345 





350 



Ala 

Leu 

Thr 

Glu 

Pro 

Phe 

Gly 

Leu 

Thr 

Leu 

lie 

Glu 

Ala 

Ala 

Ala 

Cys 



355 





360 





365 




Gly Val 

Pro 

lie 

Val 

Ala 

Thr 

Glu 

Asp 

Gly 

Gly 

Pro 

Val 

Asp 

lie 

lie 


370 





375 





380 





Lys 

Asn 

Cys 

Gin 

Asn 

Gly 

Tyr 

Leu 

lie 

Asn 

Pro 

Leu 

Asp 

Glu 

Val 

Asp 

385 





390 





395 





400 

lie 

Ala 

Asp 

Lys 

Leu 

Leu 

Lys 

Val 

Leu 

Asn 

Asp 

Lys 

Gin 

Gin 

Trp 

Gin 





405 





410 





415 


Phe 

Leu 

Ser 

Glu 

Ser 

Gly 

Leu 

Glu 

Gly 

Val 

Lys 

Arg 

His 

Tyr 

Ser 

Trp 




420 





425 





430 



Pro 

Ser 

His 

Val 

Glu 

Ser 

Tyr 

Leu 

Glu 

Ala 

lie 

Asn 

Ala 

Leu 

Thr 

Gin 



435 





440 





445 




Gin 

Thr 

Ser 

Val 

Leu 

Lys 

Arg 

Ser 

Asp 

Leu 

Lys 

Arg 

Arg 

Arg 

Thr 

Leu 


450 





455 





460 





Tyr 

Tyr 

Asn 

Gly 

Ala 

Leu 

Val 

Thr 

Ser 

Leu 

Asp 

Gin 

Asn 

Leu 

Leu 

Gly 

465 





470 





475 





480 

Ala 

Leu 

Gin 

Gly 

Gly 

Leu 

Pro 

Gly 

Asp 

Arg 

Gin 

Thr 

Leu 

Asp 

Glu 

Leu 





485 





490 





495 


Leu 

Glu 

Val 

Leu 

Tyr 

Gin 

His 

Arg 

Lys 

Asn 

Val 

Gly 

Phe 

Cys 

lie 

Ala 




500 





505 





510 



Thr 

Gly 

Arg 

Arg 

Leu 

Asp 

Ser 

Val 

Leu 

Lys 

lie 

Leu 

Arg 

Glu 

Tyr 

Arg 



515 





520 





525 




lie 

Pro 

Gin 

Pro 

Asp 

Met 

Leu 

lie 

Thr 

Ser 

Met 

Gly 

Thr 

Glu 

lie 

Tyr 


530 





535 





540 





Ser 

Ser 

Pro 

Asp 

Leu 

lie 

Pro 

Asp 

Gin 

Ser 

Trp 

Arg 

Asn 

His 

lie 

Asp 

545 





550 





555 





560 

Tyr 

Leu 

Trp 

Asn 

Arg 

Asn 

Ala 

lie 

Val 

Arg 

lie 

Leu 

Gly 

Glu 

Leu 

Pro 





565 





570 





575 


Gly 

Leu 

Ala 

Leu 

Gin 

Pro 

Lys 

Glu 

Glu 

Leu 

Ser 

Ala 

Tyr 

Lys 

lie 

Ser 




580 





585 





590 



Tyr 

Phe 

Tyr 

Asp 

Ala 

Ala 

lie 

Ala 

Pro 

Asn 

Leu 

Glu 

Glu 

lie 

Arg 

Gin 


595 600 605 


Leu Leu His Lys Gly Glu Gin Thr Val Asn Thr lie lie Ser Phe Gly 
610 615 620 
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-continued 


Gin 

Phe 

Leu 

Asp 

lie 

Leu 

Pro 

lie 

Arg 

Ala 

Ser 

Lys 

Gly 

Tyr 

Ala 

Val 

625 





630 





635 





640 

Arg 

Trp 

Leu 

Ser 

Gin 

Gin 

Trp 

Asn 

lie 

Pro 

Leu 

Glu 

His 

Val 

Phe 

Thr 





645 





650 





655 


Ala 

Gly 

Gly 

Ser 

Gly 

Ala 

Asp 

Glu 

Asp 

Met 

Met 

Arg 

Gly 

Asn 

Thr 

Leu 




660 





665 





670 



Ser 

Val 

Val 

Val 

Ala 

Asn 

Arg 

His 

His 

Glu 

Glu 

Leu 

Ser 

Asn 

Leu 

Gly 



675 





680 





685 




Glu 

He 

Glu 

Pro 

lie 

Tyr 

Phe 

Ser 

Glu 

Lys 

Arg 

Tyr 

Ala 

Ala 

Gly 

lie 


690 





695 





700 





Leu 

Asp 

Gly 

Leu 

Ala 

His 

Tyr 

Arg 

Phe 

Phe 

Glu 

Leu 

Leu 

Asp 

Pro 

Val 


705 710 715 720 


<210 > SEQ ID NO 11 
<211 > LENGTH: 244 
<212 > TYPE: PRT 

<213 > ORGANISM: Synechocyst is sp . 

<220 > FEATURE: 

<221> NAME / KEY : misc_f eature 

<223 > OTHER INFORMATION: Synechocyst is sucrose-phosphate phosphatase 
(SPP) 


<400 > SEQUENCE: 11 


Met 

Arg 

Gin 

Leu 

Leu 

Leu 

lie 

Ser 

Asp 

Leu Asp 

Asn 

Thr 

Trp 

Val 

Gly 

1 




5 





10 





15 


Asp 

Gin 

Gin 

Ala 

Leu 

Glu 

His 

Leu 

Gin 

Glu 

Tyr 

Leu 

Gly 

Asp 

Arg 

Arg 




20 





25 





30 



Gly Asn 

Phe 

Tyr 

Leu 

Ala 

Tyr 

Ala 

Thr 

Gly Arg 

Ser 

Tyr 

His 

Ser 

Ala 



35 





40 





45 




Arg 

Glu 

Leu 

Gin 

Lys 

Gin 

Val 

Gly 

Leu 

Met 

Glu 

Pro 

Asp 

Tyr 

Trp 

Leu 


50 





55 





60 





Thr 

Ala 

Val 

Gly 

Ser 

Glu 

lie 

Tyr 

His 

Pro 

Glu 

Gly 

Leu 

Asp 

Gin 

His 

65 





70 





75 





80 

Trp 

Ala 

Asp 

Tyr 

Leu 

Ser 

Glu 

His 

Trp 

Gin Arg 

Asp 

lie 

Leu 

Gin 

Ala 





85 





90 





95 


lie 

Ala 

Asp 

Gly 

Phe 

Glu 

Ala 

Leu 

Lys 

Pro 

Gin 

Ser 

Pro 

Leu 

Glu 

Gin 




100 





105 





110 



Asn 

Pro 

Trp 

Lys 

lie 

Ser 

Tyr 

His 

Leu 

Asp 

Pro 

Gin 

Ala 

Cys 

Pro 

Thr 



115 





120 





125 




Val 

lie 

Asp 

Gin 

Leu 

Thr 

Glu 

Met 

Leu 

Lys 

Glu 

Thr 

Gly 

lie 

Pro 

Val 


130 





135 





140 





Gin 

Val 

lie 

Phe 

Ser 

Ser 

Gly 

Lys 

Asp 

Val 

Asp 

Leu 

Leu 

Pro 

Gin 

Arg 

145 





150 





155 





160 

Ser 

Asn 

Lys 

Gly 

Asn 

Ala 

Thr 

Gin 

Tyr 

Leu 

Gin 

Gin 

His 

Leu 

Ala 

Met 





165 





170 





175 


Glu 

Pro 

Ser 

Gin 

Thr 

Leu 

Val 

Cys 

Gly 

Asp 

Ser 

Gly 

Asn 

Asp 

lie 

Gly 




180 





185 





190 



Leu 

Phe 

Glu 

Thr 

Ser 

Ala 

Arg 

Gly 

Val 

lie 

Val 

Arg 

Asn 

Ala 

Gin 

Pro 



195 





200 





205 




Glu 

Leu 

Leu 

His 

Trp 

Tyr 

Asp 

Gin 

Trp 

Gly Asp 

Ser 

Arg 

His 

Tyr 

Arg 


210 





215 





220 





Ala 

Gin 

Ser 

Ser 

His 

Ala 

Gly 

Ala 

lie 

Leu 

Glu 

Ala 

lie 

Ala 

His 

Phe 

225 





230 





235 





240 


Asp Phe Leu Ser 


<210 > SEQ ID NO 12 




49 


US 8,993,303 B2 


50 


-continued 


<211 > LENGTH: 521 

<212 > TYPE: PRT 

<213 > ORGANISM: Zea mays 

<220 > FEATURE: 

<221> NAME / KEY : misc_feature 
<223 > OTHER INFORMATION: ZmSUTl 


<400 > SEQUENCE: 12 


Met 

Ala 

Arg 

Gly 

Asp 

Gly 

Glu 

Leu 

Glu 

Leu 

Ser 

Val 

Gly 

Val 

Arg 

Gly 

1 




5 





10 





15 


Thr 

Gly 

Gly 

Ala 

Ala 

Ala 

Ala 

Ala 

Ala 

Ala 

Asp 

His 

Val 

Ala 

Pro 

lie 




20 





25 





30 



Ser 

Leu 

Gly 

Arg 

Leu 

lie 

Leu 

Ala 

Gly 

Met 

Val 

Ala 

Gly 

Gly 

Val 

Gin 



35 





40 





45 




Tyr 

Gly 

Trp 

Ala 

Leu 

Gin 

Leu 

Ser 

Leu 

Leu 

Thr 

Pro 

Tyr 

Val 

Gin 

Thr 


50 





55 





60 





Leu 

Gly 

Leu 

Ser 

His 

Ala 

Leu 

Thr 

Ser 

Phe 

Met 

Trp 

Leu 

Cys 

Gly 

Pro 

65 





70 





75 





80 

He 

Ala 

Gly 

Leu 

Val 

Val 

Gin 

Pro 

Leu 

Val 

Gly 

Leu 

Tyr 

Ser 

Asp 

Arg 





85 





90 





95 


Cys 

Thr 

Ala 

Arg 

Trp 

Gly 

Arg 

Arg 

Arg 

Pro 

Phe 

lie 

Leu 

lie 

Gly 

Cys 




100 





105 





110 



Met 

Leu 

lie 

Cys 

Leu 

Ala 

Val 

lie 

Val 

Val 

Gly 

Phe 

Ser 

Ser 

Asp 

lie 



115 





12 0 





125 




Gly Ala 

Ala 

Leu 

Gly 

Asp 

Thr 

Lys 

Glu 

His 

Cys 

Ser 

Leu 

Tyr 

His 

Gly 


130 





135 





140 





Pro 

Arg 

Trp 

His 

Ala 

Ala 

lie 

Val 

Tyr 

Val 

Leu 

Gly 

Phe 

Trp 

Leu 

Leu 

145 





150 





155 





160 

Asp 

Phe 

Ser 

Asn 

Asn 

Thr 

Val 

Gin 

Gly 

Pro 

Ala 

Arg 

Ala 

Met 

Met 

Ala 





165 





170 





175 


Asp 

Leu 

Cys 

Gly 

His 

His 

Gly 

Pro 

Ser 

Ala 

Ala 

Asn 

Ser 

lie 

Phe 

Cys 




180 





185 





190 



Ser 

Trp 

Met 

Ala 

Leu 

Gly 

Asn 

lie 

Leu 

Gly 

Tyr 

Ser 

Ser 

Gly 

Ser 

Thr 



195 





200 





205 




Asn 

Asn 

Trp 

His 

Lys 

Trp 

Phe 

Pro 

Phe 

Leu 

Leu 

Thr 

Asn 

Ala 

Cys 

Cys 


210 





215 





220 





Glu 

Ala 

Cys 

Ala 

Asn 

Leu 

Lys 

Gly 

Ala 

Phe 

Leu 

Val 

Ala 

Val 

Val 

Phe 

225 





230 





235 





240 

Leu 

Val 

Met 

Cys 

Leu 

Thr 

Val 

Thr 

Leu 

Phe 

Phe 

Ala 

Asn 

Glu 

Val 

Pro 





245 





250 





255 


Tyr 

Arg 

Gly 

Asn 

Gin 

Asn 

Leu 

Pro 

Thr 

Lys 

Ala 

Asn 

Gly 

Glu 

Val 

Glu 




260 





265 





270 



Thr 

Glu 

Pro 

Ser 

Gly 

Pro 

Leu 

Ala 

Val 

Leu 

Lys 

Gly 

Phe 

Lys 

Asn 

Leu 



275 





280 





285 




Pro 

Thr 

Gly 

Met 

Pro 

Ser 

Val 

Leu 

Leu 

Val 

Thr 

Gly 

Leu 

Thr 

Trp 

Leu 


290 





295 





300 





Ser 

Trp 

Phe 

Pro 

Phe 

lie 

Leu 

Tyr 

Asp 

Thr 

Asp 

Trp 

Met 

Gly 

Arg 

Glu 

305 





310 





315 





320 

lie 

Tyr 

His 

Gly 

Asp 

Pro 

Lys 

Gly 

Ser 

Asn 

Ala 

Gin 

lie 

Ser 

Ala 

Phe 





325 





330 





335 


Asp 

Glu 

Gly 

Val 

Arg 

Val 

Gly 

Ser 

Phe 

Gly 

Leu 

Leu 

Leu 

Asn 

Ser 

lie 




340 





345 





350 



Val 

Leu 

Gly 

Phe 

Ser 

Ser 

Phe 

Leu 

lie 

Glu 

Pro 

Met 

Cys 

Arg 

Lys 

Val 


355 360 365 


Gly Pro Arg Val Val Trp Val Thr Ser Asn Phe Met Val Cys Val Ala 
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370 375 380 


Met 

Ala 

Ala 

Thr 

Ala 

Leu 

lie 

Ser 

Phe 

Trp 

Ser 

Leu 

Lys 

Asp 

Tyr 

His 

385 





390 





395 





400 

Gly 

Tyr 

Val 

Gin 

Asp 

Ala 

lie 

Thr 

Ala 

Ser 

Thr 

Ser 

lie 

Lys 

Ala 

Val 





405 





410 





415 


Cys 

Leu 

Val 

Leu 

Phe 

Ala 

Phe 

Leu 

Gly 

Val 

Pro 

Leu 

Ala 

lie 

Leu 

Tyr 




420 





425 





430 



Ser 

Val 

Pro 

Phe 

Ala 

Val 

Thr 

Ala 

Gin 

Leu 

Ala 

Ala 

Thr 

Lys 

Gly Gly 



435 





440 





445 




Gly 

Gin 

Gly 

Leu 

Cys 

Thr 

Gly 

Val 

Leu 

Asn 

lie 

Ser 

lie 

Val 

lie 

Pro 


450 





455 





460 





Gin 

Val 

He 

lie 

Ala 

Leu 

Gly 

Ala 

Gly 

Pro 

Trp 

Asp 

Ala 

Leu 

Phe 

Gly 

465 





470 





475 





480 

Lys 

Gly 

Asn 

lie 

Pro 

Ala 

Phe 

Gly 

Val 

Ala 

Ser 

Gly 

Phe 

Ala 

Leu 

lie 





485 





490 





495 


Gly 

Gly 

Val 

Val 

Gly 

Val 

Phe 

Leu 

Leu 

Pro 

Lys 

lie 

Ser 

Lys 

Arg 

Gin 




500 





505 





510 



Phe 

Arg 

Ala 

Val 

Ser 

Ala 

Gly 

Gly 

His 










515 





520 










<210 > SEQ ID NO 13 
<211 > LENGTH: 517 
<212 > TYPE: PRT 

<213 > ORGANISM: Saccharum hybrid cultivar Q117 
<220 > FEATURE: 

<221 > NAME/KEY: misc_f eature 

<223 > OTHER INFORMATION: Sucrose transporter 


<400 > SEQUENCE: 

13 












Met 

Ala 

Arg 

Gly 

Asp 

Gly 

Glu 

Leu 

Glu 

Leu 

Ser 

Val 

Gly Val 

Arg 

Gly 

1 




5 





10 





15 


Ala 

Gly 

Ala 

Ala 

Ala 

Ala 

Asp 

His 

Val 

Ala 

Pro 

lie 

Ser 

Leu Gly Arg 




20 





25 





30 



Leu 

lie 

Leu 

Ala 

Gly 

Met 

Val 

Ala 

Gly 

Gly 

Val 

Gin 

Tyr 

Gly 

Trp 

Ala 



35 





40 





45 




Leu 

Gin 

Leu 

Ser 

Leu 

Leu 

Thr 

Pro 

Tyr 

Val 

Gin 

Thr 

Leu Gly 

Leu 

Ser 


50 





55 





60 





His 

Ala 

Leu 

Thr 

Ser 

Phe 

Met 

Trp 

Leu 

Cys 

Gly 

Pro 

lie 

Ala 

Gly 

Leu 

65 





70 





75 





80 

Val 

Val 

Gin 

Pro 

Leu 

Val 

Gly 

Leu 

Tyr 

Ser 

Asp 

Arg 

Cys 

Thr 

Ala 

Arg 





85 





90 





95 


Trp 

Gly 

Arg 

Arg 

Arg 

Pro 

Phe 

lie 

Leu 

Thr 

Gly 

Cys 

lie 

Leu 

lie 

Ser 




100 





105 





110 



Leu 

Ala 

Val 

lie 

Val 

Val 

Gly 

Phe 

Ser 

Ser 

Asp 

lie 

Gly Ala 

Ala 

Leu 



115 





120 





125 




Gly Asp 

Thr 

Lys 

Glu 

His 

Cys 

Ser 

Leu 

Tyr 

His 

Gly 

Pro 

Arg 

Trp 

His 


130 





135 





140 





Ala 

Ala 

lie 

Val 

Tyr 

Val 

Leu 

Gly 

Phe 

Trp 

Leu 

Leu 

Asp 

Phe 

Ser 

Asn 

145 





150 





155 





160 

Asn 

Thr 

Val 

Gin 

Gly 

Pro 

Ala 

Arg 

Ala 

Met 

Met 

Ala 

Asp 

Leu 

Cys 

Gly 





165 





170 





175 


His 

His 

Gly 

Pro 

Ser 

Ala 

Ala 

Asn 

Ser 

lie 

Phe 

Cys 

Ser 

Trp 

Met 

Ala 




180 





185 





190 



Leu 

Gly 

Asn 

lie 

Leu 

Gly 

Tyr 

Ser 

Ser 

Gly 

Ser 

Thr 

Asn 

Asn 

Trp 

His 



195 





200 





205 
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Lys 

Trp 

Phe 

Pro 

Phe 

Leu 

Lys 

Thr 

Asn 

Ala 

Cys 

Cys 

Glu 

Ala 

Cys 

Ala 


210 





215 





220 





Asn 

Leu 

Lys 

Gly 

Ala 

Phe 

Leu 

Val 

Ala 

Val 

Val 

Phe 

Leu 

Val 

lie 

Cys 

225 





230 





235 





240 

Leu 

Ala 

Val 

Thr 

Leu 

lie 

Phe 

Ala 

Lys 

Glu 

Val 

Pro 

Tyr 

Arg 

Gly 

Asn 





245 





250 





255 


Glu 

Asn 

Leu 

Pro 

Thr 

Lys 

Ala 

Asn 

Gly 

Glu 

Val 

Glu 

Ala 

Glu 

Pro 

Thr 




260 





265 





270 



Gly 

Pro 

Leu 

Ala 

Val 

Leu 

Lys 

Gly 

Phe 

Lys 

Asn 

Leu 

Pro 

Pro 

Gly 

Met 



275 





280 





285 




Pro 

Ser 

Val 

Leu 

Leu 

Val 

Thr 

Gly 

Leu 

Thr 

Trp 

Leu 

Ser 

Trp 

Phe 

Pro 


290 





295 





300 





Phe 

He 

Leu 

Tyr 

Asp 

Thr 

Asp 

Trp 

Met 

Gly 

Arg 

Glu 

lie 

Tyr 

His 

Gly 

305 





310 





315 





320 

Asp 

Pro 

Lys 

Gly 

Ser 

Asn 

Ala 

Gin 

lie 

Ser 

Ala 

Phe 

Asn 

Glu 

Gly 

Val 





325 





330 





335 


Arg 

Val 

Gly 

Ala 

Phe 

Gly 

Leu 

Leu 

Leu 

Asn 

Ser 

lie 

lie 

Leu 

Gly 

Phe 




340 





345 





350 



Ser 

Ser 

Phe 

Leu 

lie 

Glu 

Pro 

Met 

Cys 

Arg 

Lys 

Leu 

Gly 

Pro 

Arg 

Val 



355 





360 





365 




Val 

Trp 

Val 

Thr 

Ser 

Asn 

Phe 

Met 

Val 

Cys 

Val 

Ala 

Met 

Ala 

Ala 

Thr 


370 





375 





380 





Ala 

Leu 

lie 

Ser 

Tyr 

Trp 

Ser 

Leu 

Lys 

Asp 

Tyr 

His 

Gly 

Tyr 

Val 

Gin 

385 





390 





395 





400 

Asp 

Ala 

lie 

Thr 

Ala 

Ser 

Thr 

Asn 

lie 

Lys 

Ala 

Val 

Cys 

Leu 

Val 

Leu 





405 





410 





415 


Phe 

Ala 

Phe 

Leu 

Gly 

Val 

Pro 

Leu 

Ala 

lie 

Leu 

Tyr 

Ser 

Val 

Pro 

Phe 




420 





425 





430 



Ala 

Val 

Thr 

Ala 

Gin 

Leu 

Ala 

Ala 

Thr 

Lys 

Gly 

Gly 

Gly Gin 

Gly 

Leu 



435 





440 





445 




Cys 

Thr 

Gly 

Val 

Leu 

Asn 

lie 

Ser 

lie 

Val 

lie 

Pro 

Gin 

Val 

lie 

lie 


450 





455 





460 





Ala 

Leu 

Gly 

Ala 

Gly 

Pro 

Trp 

Asp 

Ala 

Leu 

Phe 

Gly 

Lys 

Gly 

Asn 

lie 

465 





470 





475 





480 

Pro 

Ala 

Phe 

Gly 

Val 

Ala 

Ser 

Gly 

Phe 

Ala 

Leu 

lie 

Gly Gly 

Val 

Val 





485 





490 





495 


Gly 

Val 

Phe 

Leu 

Leu 

Pro 

Lys 

lie 

Ser 

Lys 

Arg 

Gin 

Phe 

Arg 

Ala 

Val 




500 





505 





510 



Ser 

Ala 

Gly 

Gly 

His 













515 


<210 > SEQ ID NO 14 
<211 > LENGTH: 699 
<212 > TYPE: DNA 

<213 > ORGANISM: Anabaena cylindrica 
<220 > FEATURE: 

<221> NAME/KEY: misc_feature 

<223 > OTHER INFORMATION: Anabaena cylindrica acyl-ACP reducatase (AR) 
coding sequence 

<400 > SEQUENCE: 14 


atgcagcagc 

ttgttgagca 

aattgaaaaa 

attgatttcc 

aaagtgaaga 

atacaaagac 

60 

gcatatagcc 

gtattaatgc 

aattgtgatt 

gaaggggaac 

aagaagccca 

tgataattac 

120 

attcaactgg 

cggaactgct 

gccagaaagt 

aaagacaacc 

tgattcgctt 

atcgaagatg 

180 

gaaagccgtc 

acaagaaagg 

atttgaagct 

tgtggacgca 

atttgcaggt 

cacaccagac 

240 




55 
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atgaagtttg 

caaaagagtt 

tttctcagga 

ctgcacaaaa 

attttcaaac 

tgcggccgca 

300 

gaaggtaaag 

ttgttacttg 

cttgctgatt 

caagctttaa 

ttatcgaatg 

ttttgcgatc 

360 

gcagcataca 

acatctacat 

tcccgtcgct 

gatgatttcg 

cccgcaaaat 

tacagaaggt 

420 

gtggtcaaag 

aagaatacag 

tcatctcaat 

tttggcgaag 

tttggcttca 

agaaaacttt 

480 

gcagaatcca 

aagctgaatt 

agaaacagct 

aaccgccaaa 

atcttcccct 

agtctggaag 

540 

atgctcaacc 

aagtagcaga 

tgatgcccac 

gtcttggcaa 

tggaaaaaga 

agccttagta 

600 

gaagatttca 

tgattcaata 

cggtgaggca 

ctaagtaata 

ttggcttcac 

aactcgtgat 

660 

attatgcgtc 

tctccgctta 

cggactcata 

cctgtctaa 



699 


<210 > SEQ ID NO 15 
<211 > LENGTH: 1020 
<212 > TYPE: DNA 

<213 > ORGANISM: Anabaena cylindrica 
<220 > FEATURE: 

<221> NAME / KEY : misc_feature 

<223> OTHER INFORMATION: Anabaena cylindrica aldehyde decarbonylase (AD) 
coding sequence 

<400 > SEQUENCE: 15 


atgtttggtc 

taattggaca 

tctgactagc 

ttagaacacg 

ctcaatccgt 

agctcaagaa 

60 

ttgggatacc 

cagaatatgc 

cgatcaaggg 

ctagactttt 

ggtgtagcgc 

cccgccgcaa 

120 

attgtcgatc 

acattaccgt 

taccagcatc 

accggacaaa 

aaattgaagg 

tcggtatgta 

180 

gaatcttgct 

ttttgccgga 

aatgctggca 

aatcgccgca 

ttaaagctgc 

aactcgcaaa 

240 

attctcaacg 

ccatggctca 

tgctcaaaag 

catggcattg 

atatcacggc 

tttaggtggg 

300 

ttttcttcaa 

ttatttttga 

gaacttcaat 

ttagagcagt 

ttagccaagt 

ccgaaacgtt 

360 

aaattagaat 

ttgaacgctt 

cacaacagga 

aatacccata 

cagcctacat 

catctgtcgg 

420 

caggtagagg 

aagcatctaa 

gcaattagga 

atagaattgt 

caaaagcaac 

tgtggctgtg 

480 

tgtggcgcta 

caggggatat 

tggcagtgca 

gttacccgct 

ggttagataa 

aaaaacagat 

540 

gtccaagaat 

tactcctcat 

agcccgtaac 

caagaacgtc 

ttcaagaact 

acaagcagaa 

600 

ttgggacggg 

gtaaaatcat 

gggtttacag 

gaagcattac 

cccaagccga 

tattgtagtt 

660 

tgggttgcta 

gtatgcctaa 

aggtgtagaa 

attgacccca 

ccgtactgaa 

acaaccttgt 

720 

ttgctgattg 

atggtggcta 

tcctaaaaac 

ttagggacaa 

aaattcagca 

tcctggcgtg 

780 

tatgtattaa 

atggtggaat 

tgtcgagcat 

tccctagata 

ttgactggaa 

aattatgaaa 

840 

attgtcaata 

tggatgtccc 

agcacgccag 

ttgtttgctt 

gttttgcgga 

atcaatgctg 

900 

ctggaatttg 

agaagttata 

cacaaacttt 

tcttggggtc 

gtaatcagat 

taccgtagat 

960 

aaaatggagc 

aaattggtcg 

ggtgtcaatt 

aaacacggtt 

ttagaccatt 

attagtttag 

1020 


<210 > SEQ ID NO 16 
<211 > LENGTH: 1844 
<212 > TYPE: DNA 

<213 > ORGANISM: Pinus sabiniana 
<220 > FEATURE: 

<221> NAME / KEY : misc_f eature 
<223 > OTHER INFORMATION: MboS 

<400 > SEQUENCE: 16 


atggctctgc 

tctctgtcgc 

accgctggct 

cccagatggt 

gcgtgcacaa 

atcgttggtc 

60 

acttctacca 

aggttaaggt 

tgtccgcaga 

acgatctcaa 

cttccatccg 

catgtgtcgg 

120 

ataaccactg 

aatccggtga 

aggcgtacag 

agacgcatag 

caaatcatca 

ttccaacctc 

180 
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tgggacgata 

atttcataca 

gtccctctca 

acgccttatg 

gggcaatttc 

gtaccatgaa 

240 

agtgctcaga 

aacttattgg 

agaagtaaaa 

gagatgatca 

attcaatctc 

gcttaaagat 

300 

ggagaattaa 

tcaccccctc 

caatgatctc 

cttatgcggc 

tctctatagt 

cgatagcatt 

360 

gaacgtttgg 

gaatcgatag 

gcatttcaaa 

agtgaaataa 

aatcagctct 

ggattatgtt 

420 

tacagttatt 

ggaacgaaaa 

aggcattggg 

tggggaagag 

atagtgttgt 

tgccgatctc 

480 

aactcaactg 

ccttggggct 

tcgaactcta 

cgactacacg 

gatacccggt 

gtcttcagat 

540 

gtgttacaac 

acttcaaaga 

acaaaaaggg 

cagtttgcat 

gttcggccat 

tcaaacagag 

600 

ggagagataa 

gaagtgttct 

caacttattt 

cgggcttccc 

aaattgcctt 

tccgggagag 

660 

aaagttatgg 

aagaggcaga 

agtcttctct 

acaatatatt 

taaaagaagc 

catactaaag 

720 

cttccggtct 

gcggtctttc 

acgagagata 

tcgtacgttc 

tggaatatgg 

ttggcatata 

780 

aatttgccaa 

gattggaagc 

aaggaactac 

atcgacgtat 

ttggagagga 

ccccatttat 

84 0 

ttgacgccaa 

atatgaagac 

ccaaaaactt 

ctagaacttg 

caaagttgga 

gttcaatatg 

900 

tttcactctt 

tacaacagca 

agagctaaag 

cttctctcca 

gatggtggaa 

agattcgggt 

960 

ttctctcaaa 

tgaccttccc 

tcggcatcgt 

cacgtggaat 

attacacttt 

ggcatcttgc 

1020 

attgatagtg 

aacctcaaca 

ttcttcgttc 

agacttggat 

ttgccaaaat 

ctttcatctt 

1080 

gccacggttc 

ttgacgatat 

ttacgacacc 

tttggcacga 

tggatgagct 

agaactcttc 

1140 

acggcggcag 

ttaagaggtg 

gcatccgtct 

gcgacggagt 

ggcttccaga 

atatatgaaa 

1200 

ggagtatata 

tggtgcttta 

cgaaaccgtt 

aacgaaatgg 

caggagaagc 

agaaaagtct 

1260 

caaggccgag 

acacgctcaa 

ctatggccga 

aatgctttgg 

aggcttatat 

tgatgcttct 

1320 

atggaagaag 

cgaagtggat 

tttcagtggt 

tttttgccaa 

catttgagga 

gtacctggat 

1380 

aacgggaaag 

ttagtttcgg 

ttatggcatt 

ggcacattgc 

aacccattct 

gacgttgggc 

1440 

attccctttc 

ctcatcacat 

cctacaagaa 

atagactttc 

cttccaggct 

caatgatgtg 

1500 

gcatcttcca 

ttctccgact 

aaaaggcgac 

attcacactt 

accaggctga 

gaggagccgt 

1560 

ggagaaaaat 

cttcgtgtat 

atcatgttat 

atggaagaga 

atcccgagtc 

aacagaggaa 

1620 

gatgcaatca 

atcatatcaa 

ctccatggtc 

gacaaattac 

tcaaggaact 

aaattgggag 

1680 

tatctgagac 

ctgatagcaa 

tgttccaatc 

acttccaaga 

aacatgcatt 

tgacattctg 

1740 

agagctttct 

accatctcta 

caaataccga 

gatggcttca 

gcgttgcgaa 

ctatgaaata 

1800 

aagaatttgg 

tcatgacaac 

cgtcattgag 

cctgtgcctt 

tat a 
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What is claimed is: 

1. A composition comprising an Anabaena spp. genetically 
engineered with at least one recombinant polynucleotide 
expression construct, wherein the at least one recombinant 
polynucleotide expression constmct comprises a nucleotide 
sequence encoding at least one enzyme, wherein the at least 
one enzyme increases production of a carbon based product 
of interest by the genetically engineered Anabaena spp. fol- 
lowing expression of the polynucleotide expression con- 
stmct, wherein saidAnabaena spp. is ethanol producing Ana- 
baena sp. PCC7120 (pZR672) strain deposited under ATCC 
accession number PTA-12833 or is linalool producing Ana- 
baena sp. PCC7120 (pZR808) strain deposited under ATCC 
accession number PTA-12832. 

2 . The composition of claim 1 wherein the Anabaena spp. 
is Anabaena PCC7120 (pZR672) strain deposited under 
ATCC accession number PTA-12833. 


50 3 . The composition of claim 1, wherein the Anabaena spp. 

is linalool producing Anabaena sp. PCC7120 (pZR808) 
strain deposited under ATCC accession number PTA-12832. 

4 . The composition of claim 1 wherein the Anabaena spp. 
has an up-regulated 2-C-methyl-D-erythritol 4-phosphate 

55 (MEP) pathway. 

5 . The composition of claim 4 wherein the up-regulated 
MEP pathway is up-regulated by expressing at least one gene 
responsible for control of the MEP pathway in the Anabaena 
spp. 

60 6. The composition of claim 1 wherein the at least one 

recombinant polynucleotide expression constmct further 
comprises a nucleotide sequence encoding ribulose-l,5-bis- 
phosphate carboxylase/oxygenase (RuBisCo). 

7. The composition of claim 6 wherein the at least one 

65 recombinant polynucleotide expression constmct comprising 
a nucleotide sequence encoding RuBisCo, further comprises 
a nucleotide sequence encoding RuBisCo activase. 
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8. The composition of claim 1 wherein the carbon based 
product of interest is ethanol. 

9 . The composition of claim 1 wherein the Anabaena spp. 
is combined with a photoautotrophic liquid media, and 
optionally, wherein said media contains no combined nitro- 5 
gen. 

10 . The composition of claim 1 wherein the carbon based 
product of interest is linalool (C 10 H 18 O). 



